Today weβre releasing Qwen-Scope π, an open suite of sparse autoencoders for the Qwen model family. It turns SAE features into practical toolsοΌ π― Inference β Steer model outputs by directly manipulating internal features, no prompt engineering needed π Data β Classify & synthesize targeted data with minimal seed examples, boosting long-tail capabilities ποΈ Training β Trace code-switching & repetitive generation back to their source, fix them at the root π Evaluation β Analyze feature activation patterns to select smarter benchmarks and cut redundancy We hope the community uses Qwen-Scope to uncover new mechanisms inside Qwen models and build applications beyond what we explored.Excited to see what you build! π ππ Blog: https://t.co/ndwiE1tnb9 HuggingFace: https://t.co/1kICpK8eXG ModelScope: https://t.co/U7v1FjmPaW Technical Report: https://t.co/CZMjEZK0sa
Alibaba Qwen Releases Qwen-Scope to Turn Neural Features Into Practical Development Tools
Β· Updated
- Models covered
- 7
- SAE sets released
- 14
- Training data
- 0.5B tokens per model
- Data synthesis efficiency
- 15x improvement
- Architecture support
- Dense and Mixture of Experts
- Availability
- Hugging Face and ModelScope
This update shifts interpretability from research into a functional engineering layer. While Anthropic's neural mapping research identified character archetypes, Qwen-Scope provides a suite for active model optimization. It enables developers to fix repetitive generation by suppressing the specific neural activations responsible for those behaviors during fine-tuning (adapting a model to specific tasks).
You can use these features to steer model outputs without prompt engineering or to synthesize training data 15 times more efficiently. The toolkit also helps identify benchmark redundancy by analyzing overlapping neural patterns. The SAE weights are available now on Hugging Face for the Qwen3.5 small models.
Still wondering? A few quick answers below.





