Cursor Releases Composer 2.5 to Improve Reliability on Long Running Coding Tasks

Cursor

May 18, 2026 · Updated Jun 13, 2026

Cursor released Composer 2.5, a coding model optimized for sustained performance on complex, multi-step engineering tasks. The update introduces a new reinforcement learning method that provides localized feedback during long trajectories to reduce errors in tool use and communication.

Cursor, an AI-native code editor, released Composer 2.5 to improve reliability on long-horizon agentic tasks. While still based on the Kimi-k2.5 foundation, this version uses targeted reinforcement learning (a training method that aligns model behavior with feedback) to correct specific mistakes—like bad tool calls—within long sequences.

Pricing (input): $0.50 per million tokens
Pricing (output): $2.50 per million tokens
Fast tier (input): $3.00 per million tokens
Fast tier (output): $15.00 per million tokens
Training compute (future): 10x increase via Colossus 2
Synthetic data scale: 25x more tasks than Composer 2

This release addresses the credit assignment problem, where a single reward at the end of a long task makes it hard for models to learn from specific mid-process errors. By scaling synthetic data 25x, Cursor is targeting the multi-day autonomous engineering sessions recently teased for OpenAI's Codex.

You can use Composer 2.5 now within the Cursor editor, with usage limits doubled for the first week. Standard pricing is $0.50 per million input tokens. Looking ahead, a partnership with SpaceXAI aims to train a larger model from scratch using the massive Colossus 2 compute cluster.

View the full update on cursor.com

Cursor

@cursor_aiMay 18

Introducing Composer 2.5, our most powerful model yet. It's more intelligent, better at sustained work on long-running tasks, and more reliable at following complex instructions. For the next week, we’re doubling the included usage of the model. https://t.co/N87ojcXlOC

1.4k13k

View on X

Still wondering? A few quick answers below.

Composer 2.5 is an updated AI coding model designed for the Cursor editor that focuses on improving performance during long-running agentic tasks. It is built on the Kimi K2.5 foundation but features architectural refinements that make it more reliable at following complex, multi-step instructions and maintaining intelligence over extended work sessions.

Cursor uses targeted textual feedback to improve model behavior during long trajectories. Instead of providing a single reward at the end of a task, the system inserts specific hints at the exact point where a mistake occurred, such as an invalid tool call. This localized signal helps the model learn precise behavioral corrections without losing the broader objective.

The standard version of Composer 2.5 is priced at $0.50 per million input tokens and $2.50 per million output tokens. There is also a faster variant available that maintains the same level of intelligence for $3.00 per million input tokens and $15.00 per million output tokens, which is the default option in the editor.

The model was trained using 25x more synthetic tasks than the previous version, including techniques like feature deletion where the agent must reimplement functionality in a codebase to pass verifiable tests. This large-scale synthetic data training helps the model find sophisticated solutions and workarounds for difficult real-world engineering problems.

Cursor is collaborating with SpaceXAI to train a significantly larger frontier model from scratch using 10x more total compute than previous efforts. This training will utilize the Colossus 2 cluster, which consists of one million H100-equivalent GPUs, to achieve a major leap in model capability beyond the current Composer 2.5 release.

Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →

See all AI news & updates from Cursor →

Keep reading

Cursor Releases Composer 2 Technical Report on Coding Agent Training

Cursor published a technical report on Composer 2, a coding agent trained via pretraining on Kimi K2.5 and RL on real engineering tasks. It scores 61.3 on CursorBench — 37% above Composer 1.5 — matching frontier models at lower cost.

Cursor and Fireworks AI Detail the Specialized Training Infrastructure Behind Composer 2.5

Fireworks AIMay 27

Cursor and Fireworks AI Detail the Specialized Training Infrastructure Behind Composer 2.5

Cursor and Fireworks AI shared a technical breakdown of the distributed reinforcement learning infrastructure used to build the Composer 2.5 coding model. The team treats model weights as finite storage bits dedicated entirely to software engineering, allowing the model to match frontier performance at one-tenth the cost. This shift demonstrates how specialized products can use real-world usage as a proprietary training loop.

xAIJun 1

xAI Adds Cursor's Composer 2.5 to Grok Build for Long-Running Agentic Tasks

xAI announced that Cursor's Composer 2.5 model is now available inside Grok Build. The integration brings a specialized coding model to X Premium+ and SuperGrok subscribers, optimized for the sustained reasoning that long-running autonomous engineering tasks demand.

Cursor Publishes CursorBench, Its Internal Agentic Coding Evaluation Methodology

OpenAIMar 15

Cursor Publishes CursorBench, Its Internal Agentic Coding Evaluation Methodology

Cursor published CursorBench, its internal eval suite that scores models on real coding agent tasks from actual developer sessions. Public benchmarks struggle to differentiate frontier models reliably — CursorBench produces more separation where it matters most.

What is Cursor Composer 2.5?

How does the targeted reinforcement learning in Composer 2.5 work?

What is the pricing for Cursor Composer 2.5?

How was Composer 2.5 trained to handle complex coding tasks?

What is the partnership between Cursor and SpaceXAI?

Keep reading

Cursor Releases Composer 2 Technical Report on Coding Agent Training

Cursor Releases Composer 2 Technical Report on Coding Agent Training

Cursor and Fireworks AI Detail the Specialized Training Infrastructure Behind Composer 2.5

Cursor and Fireworks AI Detail the Specialized Training Infrastructure Behind Composer 2.5

xAI Adds Cursor's Composer 2.5 to Grok Build for Long-Running Agentic Tasks

xAI Adds Cursor's Composer 2.5 to Grok Build for Long-Running Agentic Tasks

Cursor Publishes CursorBench, Its Internal Agentic Coding Evaluation Methodology

Cursor Publishes CursorBench, Its Internal Agentic Coding Evaluation Methodology

Keep reading

Cursor Releases Composer 2 Technical Report on Coding Agent Training

Cursor Releases Composer 2 Technical Report on Coding Agent Training

Cursor and Fireworks AI Detail the Specialized Training Infrastructure Behind Composer 2.5

Cursor and Fireworks AI Detail the Specialized Training Infrastructure Behind Composer 2.5

xAI Adds Cursor's Composer 2.5 to Grok Build for Long-Running Agentic Tasks

xAI Adds Cursor's Composer 2.5 to Grok Build for Long-Running Agentic Tasks

Cursor Publishes CursorBench, Its Internal Agentic Coding Evaluation Methodology

Cursor Publishes CursorBench, Its Internal Agentic Coding Evaluation Methodology