LLM Explainer Series
LLM Explainer Series
A plain-language series unpacking why modern models use Transformers, what attention does, how LLMs are trained, and how we should evaluate them.
Phase 1 · Transformer fundamentals
01What problem did the Transformer actually solve?
From the sequential limits of RNNs to attention and parallel training.
Attention: how does a model decide where to look?
An intuitive explanation of Query, Key, Value, and attention weights.
Transformer Block: the building block of LLMs
Multi-Head Attention, FFN, residual connections, and LayerNorm — what each component actually does.