Ep 26 Blog November 21, 2025 1:00 w/ Justy & Cody

Agentic Context Engineering Ace Self Improving LLMs via Evolving Contexts Not Fine Tuning

Tech News AI Paper Summary Technology Artificial Intelligence Editors Pick Machine Learning Staff Agentic Context Engineering (ACE): Self-Improving LLMs via Evolving Contexts, Not Fine-Tuning By Asif Razzaq - October 10, 2025 TL;DR : A team of researchers from Stanford University, SambaNova Systems and UC Berkeley introduce ACE framework that improves LLM performance by editing and growing the input context instead of updating model weights. Context is treated as a living “playbook” maintained by three roles— Generator, Reflector, Curator —with small delta items merged incrementally to avoid brevity bias and context collapse.

Read the source → Plain-text transcript →

Embed this episode

Paste this on any site — the player is a self-contained iframe with no cookies or trackers.

<iframe src="https://sandrise.io/exploring-next/embed/26"
  width="100%" height="180" style="max-width:640px;border:0;border-radius:12px;overflow:hidden"
  title="Exploring Next — Episode 26 audio player"
  loading="lazy" allow="autoplay" referrerpolicy="strict-origin-when-cross-origin"></iframe>

Embed & API docs →

Voice OpenAI TTS

Transcript

Host A Welcome back to Exploring Next! Today we're looking at marktechpost.com/2025/10/10/agentic-context-engineering-ace-self-improving-llms-via-evolving-contexts-not-fine-tuning.

Host B Yeah, this one caught our eye because Tech News AI Paper Summary Technology Artificial Intelligence Editors Pick Machine Learning Staff Agentic Context Engineering (ACE): Self-Improving LLMs via Evolving Contexts, Not Fine-Tuning By Asif Razzaq - October 10, 2025 TL;DR : A team of researchers from Stanford University, SambaNova Systems and UC Berkeley introduce ACE framework that improves LLM performance by editing and growing the input context instead of updating model weigh

Host A So the big idea is Context is treated as a living “playbook” maintained by three roles— Generator, Reflector, Curator —with small delta items merged incrementally to avoid brevity bias and context collapse.

Host B What stood out to me is Reported gains: +10.6% on AppWorld agent tasks, +8.6% on finance reasoning, and ~86.9% average latency reduction vs strong context-adaptation baselines.

Host A If you're curious, give the original a read: https://www.marktechpost.com/2025/10/10/agentic-context-engineering-ace-self-improving-llms-via-evolving-contexts-not-fine-tuning/.

Host B And let us know what you try next!