# Transformer Models
## Learning Resources
[Review of the Attention Is All You Need paper](https://www.youtube.com/watch?v=iDulhoQ2pro)
[The Illustrated Transformer](https://jalammar.github.io/illustrated-transformer/)
[Deconstructing BERT, Part 2: Visualizing the Inner Workings of Attention](https://towardsdatascience.com/deconstructing-bert-part-2-visualizing-the-inner-workings-of-attention-60a16d86b5c1)