Qwen 3.7 Max Preview: What Alibaba's New AI Gets Right and Where It Falls Short Decrypt
Justy and Cody react to Alibaba's Qwen 3.7 Max preview on Arena AI: its surprise rankings (#13 text, #5 vision globally), the open/closed strategy (Plus open, Max proprietary), and a wild creative-writing test where Qwen nailed Caribbean cultural depth. Cody questions the consistency of crowd-sourced rankings, Justy sees a market signal for non-Western developers. They tease the timing (preview lands five days before Alibaba Cloud Summit) and the model’s 'deep thinking mode' preview limits.
Script: Mistral Medium 3.5 128B Voice: Inworld TTS 1.5 Mini
Transcript
Justy Okay, Alibaba just dropped Qwen 3.7 Max on Arena AI—five days before their Cloud Summit.
Cody Of course they did.
Justy It’s number thirteen globally in text, sixth lab overall. And Cody, it’s not just hype—
Cody Right.
Justy they’re validating on blind user tests first. Arena’s crowd-sourced, so the rankings mean something.
Cody Yeah, smarter than slapping a benchmark score on a press release.
Justy Also, Plus is open source, Max is paywalled. Classic Alibaba—dangle the good stuff, then monetize.
Cody I mean, they killed the free tier of Qwen Code last month.
Justy I know, I know. Anyway—this thing nailed the Decrypt creative test.
Cody Mm-hm.
Justy Same prompt as MiMo. Time travel, cultural background, paradox. Qwen went full Caribbean—Neo-Borinquen, Taíno spirits, Yemayá references. Like it actually GETS the cultural depth.
Cody Okay, that’s not nothing.
Justy MiMo’s prose was lush, sure, but Qwen’s opening was tight and specific. That’s a real signal for devs outside the usual Western bubble.
Cody Hm. Still, it’s a preview—deep thinking mode only, no web search, no code interpreter.
Justy Yeah, but the Summit was May twentieth. This was the appetizer.
Cody And now we wait for the main course.
Justy Cody you’re impossible.
Justy Fine. So who actually cares? If you’re building for non-English markets or you need agentic coding models that run locally—
Cody Or you just like a model that doesn’t butcher your cultural references—
Justy Exactly. That’s the take.
Cody I still think Arena rankings are noisy.
Justy Noisier than benchmarks?
Cody …Fair.
Justy Anyway. Exploring Next episode four twenty-one, done.