Exploring Next

Exploring Next — Ep 309 w/ Justy & Cody — 6 Things I Learned Building LLMs From Scratch That No Tutorial Teaches You | Towards Data Science

Justy and Cody dig into what actually changes when you stop calling an LLM API and start building pieces yourself: why fine-tuning tricks like RsLoRA matter, why RoPE won, where weight tying still makes sense, why Pre-LN became the default, and how KV cache buys speed by spending memory.

Open source article

Full episode page with transcript →

Browse all Exploring Next episodes →