Exploring Next
Exploring Next — Ep 286 w/ Justy & Cody — Databricks tested a stronger model against its multi-step agent on hybrid queries. The stronger model still lost by 21%.
Databricks' research shows multi-step agents outperform single-turn RAG systems on hybrid queries, achieving gains of 20% or more on Stanford's STaRK benchmark suite.