Loading article…
Alibaba's Qwen 3.7 Max can run autonomous tasks for up to 35 hours, handle 1,000+ tool calls, and scores high on coding and reasoning benchmarks.
Alibaba introduced its newest flagship model, Qwen 3.7 Max, a proprietary AI designed for long‑horizon, multi‑step tasks such as coding projects and enterprise automation. The model can execute more than 1,000 tool calls in a single run and runs about ten times faster than its predecessor, according to Alibaba’s internal testing [1].
Key takeaways
Qwen 3.7 Max first appeared as a preview on the LM Arena leaderboard on May 14 and was formally announced at the Alibaba Cloud Summit in Hangzhou between May 19 and May 21 [1]. Alibaba credits iterative kernel optimizations for a ten‑fold speed improvement over earlier Qwen versions, enabling the model to sustain performance across hundreds or thousands of sequential actions—a capability it demonstrates by autonomously executing over 1,000 tool calls in internal tests [1].
The model’s endurance was highlighted in a demonstration where it ran for 35 hours on an isolated server with a T‑Head ZW‑M890 PPU, a hardware platform it had never seen during training. During this run it performed 1,158 distinct tool calls, evaluated 432 kernels, and achieved a 10.0× geometric‑mean speedup on an attention‑kernel optimization task [2]. Benchmark results show strong reasoning and coding abilities: 92.4 on GPQA Diamond (graduate‑level reasoning), 80.4 on SWE‑Verified (software‑engineering), and 91.6 on LiveCodeBench (coding on unseen problems) [1].
Qwen 3.7 Max is built for broad compatibility, supporting OpenAI‑style API formats and agent frameworks such as Claude Code and OpenClaw, and it natively handles the Anthropic API protocol [2]. Its massive 1‑million‑token context window and 64K output limit give developers ample space for large codebases and lengthy technical documents [2].
Access is provided through Alibaba Cloud Model Studio, with token pricing of $2.50 per million input tokens and $7.50 per million output tokens, plus a $10 fee per 1,000 integrated web‑search calls; code‑interpreter tools are free for a limited period [2]. This places Qwen 3.7 Max at roughly half the cost of comparable Western models such as OpenAI’s GPT‑5.4 or Anthropic’s Claude Opus 4.7, though its availability only from Chinese endpoints may deter enterprises bound by strict data‑locality rules [2].
Coverage is mostly measured — 25 of 26 reports stay neutral.
Every Monday — the token unlocks, Fed dates & catalysts set to move crypto and markets this week. So you’re never blindsided.
Free · 3-min read · one-click unsubscribe
AI-assisted synthesis by the TrendWatcher Editorial Desk · sourced from 2 outlets · Jun 2, 2026 ·
Qwen is a trending topic in the news. Recent coverage of Qwen includes: Unified Embodied AI with Qwen-VLA - StartupHub.
10 news sources analyzed
Based on our analysis of recent news articles, Qwen has mixed coverage. Check the sentiment score above for detailed analysis.
TrendWatcher aggregates Qwen news from 100+ trusted sources and provides AI-powered sentiment analysis updated in real-time.
Qwen 3.7 Max signals Alibaba’s serious entry into the “agent era,” where AI models are expected to plan, execute, and self‑correct over extended periods rather than produce single‑turn outputs. Its ability to sustain thousands of tool calls and run for dozens of hours opens new possibilities for automated software pipelines, multi‑system customer‑service workflows, and complex financial reporting tasks that currently require human oversight.
The model’s competitive benchmark scores and lower pricing could attract developers seeking high‑performance agents without the premium of Western providers. However, the reliance on Chinese cloud endpoints may limit adoption among firms that must comply with data‑sovereignty regulations, potentially shaping the geographic split of AI‑agent deployments in the coming months.