Local Qwen3.6 Outperforms Claude Opus 4.7 on Complex Spatial Coding Tasks

Simon Willison

Apr 17, 2026 · Updated Jun 5, 2026

Alibaba released Qwen3.6-35B-A3B, an open-weight model that outperformed Anthropic's new Claude Opus 4.7 in generating complex SVG code during independent testing. Despite running locally on a laptop, the quantized Qwen model correctly rendered spatial geometries that the frontier model failed to grasp even with maximum reasoning enabled. This shift suggests that optimized local models are now competitive with massive proprietary systems for specialized technical workflows.

Simon Willison, creator of Datasette, reported that Alibaba's new Qwen3.6-35B-A3B open-weight model outperformed Anthropic's flagship Claude Opus 4.7 in SVG generation tests. A 21GB quantized version produced superior spatial results for prompts that the frontier model failed to render correctly.

This result challenges the assumption that frontier models always provide the highest quality for technical tasks. Even when Claude Opus 4.7 used its maximum thinking budget, it failed to correctly render a bicycle frame that the smaller Qwen model handled accurately. Scale and thinking tokens do not always guarantee superior spatial reasoning.

Run Qwen3.6-35B-A3B locally using tools like LM Studio or Ollama with GGUF (a file format optimized for local LLM execution) weights. This setup is effective for SVG illustration and technical code generation. The model is available on Hugging Face as a cost-effective alternative to proprietary APIs.

View the full update on simonwillison.net

Simon Willison

@simonwApr 16

Shocking result on my pelican benchmark this morning, I got a better pelican from a 21GB local Qwen3.6-35B-A3B running on my laptop than I did from the new Opus 4.7! Qwen on the left, Opus on the right https://t.co/kDlbnJv6YI

1302.1k

View on X

Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →