GPT-2 Small

Input Embeddings
Layer 0
Attention Heads
Feed Forward
Layer 1
Attention Heads
Feed Forward
Layer 2
Attention Heads
Feed Forward
Layer 3
Attention Heads
Feed Forward
Layer 4
Attention Heads
Feed Forward
Layer 5
Attention Heads
Feed Forward
Layer 6
Attention Heads
Feed Forward
Layer 7
Attention Heads
Feed Forward
Layer 8
Attention Heads
Feed Forward
Layer 9
Attention Heads
Feed Forward
Layer 10
Attention Heads
Feed Forward
Layer 11
Attention Heads
Feed Forward
Output (Logits)