Skip to content
Free · 1,000+ readers
Free · Independent
The daily record of artificial intelligence
← Back
AI

Alibaba released Qwen3.7-Max, a proprietary agent model

The API-only model, available on Alibaba Cloud’s Model Studio with a promotional discount, integrates with agent frameworks; Alibaba reports competitive benchmark scores but independent verification remains scarce.

Monday, June 1, 2026 · min
Alibaba released Qwen3.7-Max, a proprietary agent model

Alibaba released Qwen3.7-Max in late May 2026, a proprietary large language model engineered for agentic workflows, marking a strategic shift for the Qwen team from open-weight research models toward closed, API-only frontier systems. The model is positioned for coding, debugging, office automation and long-horizon tool-use tasks, and Alibaba Cloud is pairing it with infrastructure on Model Studio, heavily discounted initial pricing and integration with external agent harnesses such as Anthropic’s Claude Code.

The launch is the most explicit step yet in Alibaba’s push to commercialize full-stack AI agents, tying model capability directly to cloud services. For enterprise buyers, it signals a bet that the next wave of AI spending will be on autonomous software engineering and business-process automation, not just conversational fluency.

Qwen3.7-Max is proprietary and API-only, accessible through Alibaba Cloud’s Model Studio. Official pricing is $2.50 per million input tokens and $7.50 per million output tokens, with a promotional 50% discount cutting those to $1.25 and $3.75 until June 22, 2026. The company’s example agent setup uses a context window of 1 million tokens and a maximum output of 65,536 tokens. Alibaba says the model can be integrated with agent frameworks including Anthropic’s Claude Code, OpenClaw and Qwen Code. On May 26, Alibaba Cloud announced availability in its Singapore region, though the initial blog post had described access as “coming soon”; broader global rollout details remain thin.

Alibaba reported benchmark scores that place Qwen3.7-Max among high-performing frontier models: 92.4 on GPQA Diamond, 80.4 on SWE-Verified (which the company said is on par with Anthropic’s Opus-4.6 Max at 80.8 and DeepSeek’s DS-V4-Pro Max at 80.6), 69.7 on Terminal-Bench 2.0-Terminus, 60.8 on MCP-Mark and 76.4 on MCP-Atlas. On Artificial Analysis’s Intelligence Index version 4.0, the model scored 56.6, which Alibaba said ranked it fifth globally and first among Chinese models. Artificial Analysis notes, however, that some sub-scores may be lab-claimed rather than independently measured, and many of the detailed benchmarks come from Alibaba’s own reporting.

Alibaba also described an internal demonstration in which Qwen3.7-Max ran autonomously for about 35 hours, executed 432 kernel evaluations, made 1,158 tool calls and achieved a 10.0x geometric mean speedup over a Triton reference on a T-Head ZW-M890 PPU. The company has not released public logs, code or hardware details to allow independent audit.

The release is part of a wider agentic-ai ecosystem push that includes skills, multi-agent orchestration and enterprise tools. In its FY2026 annual report filed with the Hong Kong exchange, Alibaba called Qwen3.7-Max the latest-generation large language model engineered for agents and said the Qwen family saw three updates in the previous three months. That rapid cadence contrasts with the slower, open-weight rollout of earlier Qwen models and mirrors an industry trend in which frontier labs gate their most capable systems behind paid APIs. Alibaba’s mention of three updates has been interpreted by some observers, including Chinese tech media outlet LeiPhone, as evidence of a 30-day flagship release cycle, but the company has not publicly committed to such a rhythm.

Several uncertainties remain. Independent parties have not yet fully reproduced the benchmark scores. The 35-hour kernel optimization demo lacks verifiable evidence. While Singapore availability was confirmed, Alibaba has not disclosed broader region coverage, data-residency policies or enterprise service-level agreements. Earlier preview versions—Qwen3.5-Max-Preview and Qwen3.6-Max-Preview, reported by secondary sources as released in March and April 2026—lack primary-source confirmation.

Qwen3.7-Max signals Alibaba’s determination to compete in enterprise AI agents across model capability, cloud infrastructure and developer tooling. But the claims of parity with GPT and Claude rest largely on vendor-reported benchmarks, and the company has not yet provided the independent evidence that would solidify the model’s standing among global leaders.

— End —