Qwen 3.5 Medium Models Match Flagship Performance With a Fraction of Parameters

Qwen

Feb 24, 2026 · Updated Apr 25, 2026

Alibaba's Qwen team released four medium-sized models that match their previous flagship while activating a fraction of the parameters. The standout Qwen3.5-35B-A3B uses just 3 billion active parameters yet surpasses Qwen3-235B-A22B across reasoning, coding, and agentic benchmarks.

Qwen released the Qwen 3.5 Medium Model Series - four models spanning Qwen3.5-Flash, Qwen3.5-35B-A3B, Qwen3.5-122B-A10B, and Qwen3.5-27B. The headline result is the 35B-A3B: a sparse mixture-of-experts model that activates just 3 billion parameters per forward pass, yet matches or exceeds the previous 235B flagship across SWE-bench Verified, GPQA Diamond, and agentic tool-use benchmarks.

The architecture combines Gated Delta Networks for linear attention with sparse MoE layers, delivering high throughput at a fraction of the inference cost. All models support native vision-language understanding through early fusion training and 201 languages out of the box. Qwen3.5-Flash, the hosted production version, extends context to 1 million tokens.

Weights for the open models are available on Hugging Face, compatible with vLLM, SGLang, and Transformers.

View the full update on huggingface.co

Qwen

@Alibaba_QwenFeb 24

🚀 Introducing the Qwen 3.5 Medium Model Series Qwen3.5-Flash · Qwen3.5-35B-A3B · Qwen3.5-122B-A10B · Qwen3.5-27B ✨ More intelligence, less compute. • Qwen3.5-35B-A3B now surpasses Qwen3-235B-A22B-2507 and Qwen3-VL-235B-A22B — a reminder that better architecture, data quality, https://t.co/ZWPibMn6at

1.1k

View on X

Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →

See all AI news & updates from Qwen →

Keep reading

Alibaba Open Sources Qwen3.6-35B-A3B Rivaling Frontier Models With 3B Active Parameters

Alibaba open-sourced Qwen3.6-35B-A3B, a sparse Mixture-of-Experts model with 35 billion total parameters that only activates 3 billion during inference. Despite its small active size, it matches frontier models in agentic coding and multimodal reasoning while operating under a permissive Apache 2.0 license.

Fireworks AI Adds Qwen 3.5 Training to Build Custom Reasoning Agents

Fireworks AIApr 30

Fireworks AI Adds Qwen 3.5 Training to Build Custom Reasoning Agents

Fireworks AI integrated Alibaba's Qwen 3.5 into its training platform, supporting full-parameter fine-tuning and reinforcement learning with a 256K context window. This allows developers to customize the high-performance open-weight model for specialized reasoning and coding tasks on a unified stack.

Alibaba Qwen3.7 Preview Enters Arena Top 15 for Text and Vision

ArenaMay 18

Alibaba Qwen3.7 Preview Enters Arena Top 15 for Text and Vision

Alibaba's Qwen3.7 Max and Plus preview models have debuted on the Arena.ai leaderboards, ranking #13 in text and #16 in vision. The results establish Alibaba as a top-six global AI lab with specific strengths in math, software engineering, and expert-level reasoning.

OpenRouter Adds Qwen3.7-Max for Long Horizon Agentic Coding and Office Tasks

OpenRouterMay 21

OpenRouter Adds Qwen3.7-Max for Long Horizon Agentic Coding and Office Tasks

OpenRouter integrated Alibaba's Qwen3.7-Max, a flagship model optimized for autonomous agent loops and multi-hour task execution. The update introduces explicit prompt caching for the Qwen series, allowing developers to maintain massive context windows at a 90 percent discount on subsequent requests.