Exploring Next
Exploring Next — Ep 353 w/ Justy & Cody — DV-World: Benchmarking Data Visualization Agents in Real-World Scenarios
Justy and Cody dig into DV-World, a new benchmark from a multi-institution research team that stress-tests AI data visualization agents on real-world tasks — spreadsheet manipulation, cross-framework chart evolution, and handling ambiguous user intent. Even the best models top out around 50%, which tells you a lot about where the gap actually is.