Qwen-Image-2.0-Pro is now live 🚀🚀 We’ve pushed image quality, multilingual text rendering, and instruction following to a new level, while making performance much more consistent across styles.🌅🌃 Ranked #9 worldwide for Text-to-Image on @arena 🔗Try it now on ModelScope: https://t.co/pPtrbjzzBK https://t.co/raB6WWMEMP API:https://t.co/EgYS5qt2bF
Alibaba Qwen Launches Qwen-Image-2.0-Pro for Professional Infographics and 2K Design
Alibaba released Qwen-Image-2.0-Pro, a unified model that merges image generation and editing into a single 7B-parameter architecture. The update introduces 1K-token instruction support and native 2K resolution, shifting AI imagery from artistic illustration toward functional, text-heavy professional design.
- Architecture
- Unified generation and editing
- Decoder parameters
- 7B
- Encoder parameters
- 8B
- Native resolution
- 2048x2048
- Instruction limit
- 1K tokens
- Arena ranking
- #9 Text-to-Image
- Availability
- API, ModelScope
This release mirrors the shift toward functional design seen in OpenAI's ChatGPT Images 2.0. While most models struggle with long prompts, this version supports 1K-token instructions to render complex assets like multi-panel comics and infographics. It currently ranks #9 globally on the Text-to-Image Arena leaderboard.
You can use the model for workflows requiring precise typography, such as generating PPT slides or posters with structured grids. It is available via API on Alibaba Cloud's Model Studio and for testing on ModelScope. The unified architecture allows you to edit photos by adding text without switching models.
Still wondering? A few quick answers below.
Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →





