🛠️ What's Included Complete architecture tutorial (RoPE, Attention, Transformer blocks) Training pipeline (data loading, optimization, checkpointing) Text generation (sampling, temperature, top-p) ...