# Transformer Models ## Learning Resources [Review of the Attention Is All You Need paper](https://www.youtube.com/watch?v=iDulhoQ2pro) [The Illustrated Transformer](https://jalammar.github.io/illustrated-transformer/) [Deconstructing BERT, Part 2: Visualizing the Inner Workings of Attention](https://towardsdatascience.com/deconstructing-bert-part-2-visualizing-the-inner-workings-of-attention-60a16d86b5c1)