# Visualization of self-attention maps in vision [https://epfml.github.io/attention-cnn](https://epfml.github.io/attention-cnn) # BertViz: Visualization of attention in NLP models [https://github.com/jessevig/bertviz](https://github.com/jessevig/bertviz) # Visualization of RNNs [https://distill.pub/2019/memorization-in-rnns](https://distill.pub/2019/memorization-in-rnns) # Peter Bloem blog on Transformers [https://peterbloem.nl/blog/transformers](https://peterbloem.nl/blog/transformers) # Harvard NLP on Transformers [https://nlp.seas.harvard.edu/2018/04/03/attention.html](https://nlp.seas.harvard.edu/2018/04/03/attention.html) # Explained Transformers [https://e2eml.school/transformers.html](https://e2eml.school/transformers.html) # Loss landscapes artistic visualization [https://losslandscape.com/](https://losslandscape.com/) # Dylan Patek blog on semiconductors industry and AI [https://semianalysis.com](https://semianalysis.com) # ArsTechnica on the latest interpretability attempts for LLMs [https://arstechnica.com/science/2023/07/a-jargon-free-explanation-of-how-ai-large-language-models-work/](https://arstechnica.com/science/2023/07/a-jargon-free-explanation-of-how-ai-large-language-models-work/) # Related sources and news - https://www.reuters.com/legal/litigation/judge-pares-down-artists-ai-copyright-lawsuit-against-midjourney-stability-ai-2023-10-30 - https://analyticsindiamag.com/is-ai-fast-becoming-a-technology-built-on-worker-exploitation-from-global-south/ - https://www.businessinsider.com/openai-kenyan-contract-workers-label-toxic-content-chatgpt-training-report-2023-1 - https://www.artisana.ai/articles/gpt-4-outperforms-elite-crowdworkers-saving-researchers-usd500-000-and-20 # Papers - [Vision is worth 16x16 words](https://arxiv.org/abs/2010.11929) - [GraphCast: Learning skillfull medium range global weather forecasting](https://arxiv.org/abs/2212.12794) - [UNetR](https://arxiv.org/abs/2103.10504) - [DallE-Hidden language](https://arxiv.org/abs/2206.00169) - [Word2Vec](https://arxiv.org/abs/1310.4546) - [Language models represent space and time](https://arxiv.org/abs/2310.02207) - [Interpretability in the wild](https://arxiv.org/abs/2211.00593) - [Scaling laws](https://arxiv.org/abs/2001.08361) - [Generative models as complex systems](https://arxiv.org/abs/2308.00189) - [Formal algorithms for Transformers](https://arxiv.org/abs/2207.09238)