Exploring Next

Exploring Next — Ep 286 w/ Justy & Cody — Databricks tested a stronger model against its multi-step agent on hybrid queries. The stronger model still lost by 21%.

Databricks' research shows multi-step agents outperform single-turn RAG systems on hybrid queries, achieving gains of 20% or more on Stanford's STaRK benchmark suite.

Open source article

Full episode page with transcript →

Browse all Exploring Next episodes →