Transformer Architecture for LLMs

Maryam Bahrami
Artificial Intelligence in Plain English
5 min readDec 26, 2023

--

Transformers opened a new door through Natural Language Processing, and it is the basis of LLMs, which outperforms the earlier NLP models such as RNN. Google introduced Transformers in 2017 in the paper entitled: Attention is all you need. In this tutorial, I tried to explain the architecture based on the Google paper in a simpler way.

This is how I see Large Language Models (Photo is create by DALL.E 2)

1. Attention Familly

--

--