Exploring Next

Exploring Next — Ep 353 w/ Justy & Cody — DV-World: Benchmarking Data Visualization Agents in Real-World Scenarios

Justy and Cody dig into DV-World, a new benchmark from a multi-institution research team that stress-tests AI data visualization agents on real-world tasks — spreadsheet manipulation, cross-framework chart evolution, and handling ambiguous user intent. Even the best models top out around 50%, which tells you a lot about where the gap actually is.

Open source article

Full episode page with transcript →

Browse all Exploring Next episodes →