Exploring Next

Exploring Next — Ep 153 w/ Justy & Cody — Linear representations in language models can change dramatically over a conversation

This episode dives into the significant findings of recent research on how language models adjust their internal representations during conversations. We explore the implications of these changes for developers and practitioners in AI, discuss potential applications, and highlight the challenges they present for interpretability and reliability in AI outputs.

Open source article

Full episode page with transcript →

Browse all Exploring Next episodes →