GPT-5.5 by @OpenAI is now live in the Arena, landing across multiple leaderboards. Here’s how it ranks by modality: - Code Arena (agentic web dev): #9, a strong +50pt jump over GPT-5.4 - Document Arena (analysis & long-content reasoning): #6, on par with Sonnet 4.6 - Text Arena: #7, Math #3, Instruction Following: #8 - Expert Arena: #5 - Search Arena: #2 - Vision Arena: #5 Strong, well-rounded performance, especially in Code (+50 pts vs GPT-5.4). Congrats to @OpenAI on the release. Full category breakdowns by modality in the thread.
Arena.ai Ranks GPT-5.5 as Top Tier for Search and Coding
Arena· Updated
GPT-5.5 entered the Arena.ai leaderboards with a top-two ranking in search and a 50-point performance jump in agentic web development. These community-driven results validate the model's focus on complex tool use and reasoning across vision, math, and document analysis.
- Code Arena Rank
- #9 (+50 pts vs GPT-5.4)
- Search Arena Rank
- #2
- Math Rank
- #3
- Expert Arena Rank
- #5
- Vision Arena Rank
- #5 (#1 for Diagrams)
- Document Arena Rank
- #6
- Reasoning Effort Evaluated
- Medium and High
- Availability
- ChatGPT and Codex API
These results provide objective validation, following a pattern seen in the GPT-5.5 launch, which OpenAI positioned as a new class of intelligence for agentic work. While it currently trails, mirroring Alibaba's Qwen3.6 Plus, the point increase suggests a major shift in how the model handles multi-step goals.
Use these rankings to decide which modality—such as the #1 ranked diagram analysis or #6 ranked document reasoning—best fits your workflow. Current scores reflect "medium" and "high" reasoning effort levels, with an xHigh evaluation pending. GPT-5.5 is available via ChatGPT and the Codex API.
Still wondering? A few quick answers below.
Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →



