Cursor and Fireworks AI Detail the Specialized Training Infrastructure Behind Composer 2.5

Fireworks AI

May 27, 2026 · Updated Jun 12, 2026

Cursor and Fireworks AI shared a technical breakdown of the distributed reinforcement learning infrastructure used to build the Composer 2.5 coding model. The team treats model weights as finite storage bits dedicated entirely to software engineering, allowing the model to match frontier performance at one-tenth the cost. This shift demonstrates how specialized products can use real-world usage as a proprietary training loop.

Fireworks AI, an inference platform for fast model serving, detailed the engineering behind Composer 2.5 following the Cursor Composer 2.5 product release. Composer treats weights as finite storage bits dedicated solely to coding. This specialization, plus pipelined reinforcement learning, matches frontier performance at 10x lower cost than Claude Opus.

Pricing (input): $0.50 per million tokens
Pricing (output): $2.50 per million tokens
Weight sync speed: Under 1 minute for 1TB
Compression ratio: 20x for weight transfers
Update frequency: Every few hours

This approach solves scaling bottlenecks in distributed reinforcement learning. The team used delta compression to sync 1TB of weights across global clusters in under a minute. They also introduced "router replay" to fix numerical divergence in Mixture of Experts models, ensuring training and inference workers activate the same experts.

Cursor now uses real-time reinforcement learning to ship model updates every few hours. This turns the product into a proprietary training environment. Building on the Cursor Composer 2 technical report, the model is available now for users at $0.50 per million input tokens.

View the full update on open.spotify.com

Fireworks AI

@FireworksAI_HQMay 27

1/ Composer 2.5 is having a moment. Worth a look at how the team actually got here. @cursor_ai's Federico Cassano and @FireworksAI_HQ cofounder Dima Dzhulgakov discussed Training Data with @sonyatweetybird. The whole episode is worth your time, but we’ll break it down here.

5115

View on X

Still wondering? A few quick answers below.

Composer 2.5 is a specialized agentic coding model developed by Cursor that autonomously writes, tests, and iterates on software across complex codebases. Unlike general-purpose models, it is trained specifically for engineering tasks. This specialization allows it to match the performance of frontier models like Claude Opus while operating at one-tenth the cost.

Cursor uses a top-down training approach that combines mid-training on code with large-scale reinforcement learning. They implement pipelined reinforcement learning, which allows training and data collection to happen simultaneously. This method maximizes GPU utilization and enables the team to ship updated versions of the model to users every few hours based on real-world usage.

The model is designed to be significantly more cost-effective than general-purpose frontier models. It is currently priced at $0.50 per million input tokens and $2.50 per million output tokens. This lower price point is made possible by dedicating the model's finite weight capacity entirely to software engineering tasks rather than general-world knowledge.

Cursor uses a custom infrastructure built on Fireworks AI to sync 1TB of model weights across global clusters in under a minute. They use a lossless delta compression scheme to shrink data transfers by 20x. They also use a technique called router replay to prevent numerical errors that can cause training to fail in distributed environments.

Cursor specializes its models because it views model weights as a finite storage drive with limited bits. By intentionally excluding general-world information and focusing all capacity on coding, the model becomes more intelligent and efficient at specific engineering tasks. This specialization creates a proprietary moat by turning actual product usage into a continuous training loop.

Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →

See all AI news & updates from Fireworks AI →

Keep reading

Fireworks AI Powers Cursor Composer 2 With Distributed Global RL Infrastructure

Fireworks AI revealed the infrastructure behind Cursor's Composer 2, using disaggregated sampling to run RL across multiple global clusters. By shipping only 2% of model weights as compressed deltas, they eliminated the need for a single massive mega-cluster. This shift makes frontier-scale RL training economically viable using fragmented, multi-region GPU capacity.

Cursor Releases Composer 2.5 to Improve Reliability on Long Running Coding Tasks

CursorMay 18

Cursor Releases Composer 2.5 to Improve Reliability on Long Running Coding Tasks

Cursor released Composer 2.5, a coding model optimized for sustained performance on complex, multi-step engineering tasks. The update introduces a new reinforcement learning method that provides localized feedback during long trajectories to reduce errors in tool use and communication.

Kimi-k2.5 Powers Cursor Composer 2 via Commercial Open Model Partnership

KimiMar 21

Kimi-k2.5 Powers Cursor Composer 2 via Commercial Open Model Partnership

Moonshot AI confirmed Kimi-k2.5 is the foundation model behind Cursor Composer 2. Cursor applied continued pretraining and high-compute RL training on top of it, with inference hosted via FireworksAI under an authorized commercial partnership.

Cursor Publishes CursorBench, Its Internal Agentic Coding Evaluation Methodology

OpenAIMar 15

Cursor Publishes CursorBench, Its Internal Agentic Coding Evaluation Methodology

Cursor published CursorBench, its internal eval suite that scores models on real coding agent tasks from actual developer sessions. Public benchmarks struggle to differentiate frontier models reliably — CursorBench produces more separation where it matters most.

What is Cursor Composer 2.5?

How does Cursor train its coding models more efficiently?

What is the pricing for Cursor Composer 2.5?

How does Cursor handle distributed reinforcement learning at scale?

Why does Cursor specialize its models for software engineering?

Keep reading

Fireworks AI Powers Cursor Composer 2 With Distributed Global RL Infrastructure

Fireworks AI Powers Cursor Composer 2 With Distributed Global RL Infrastructure

Cursor Releases Composer 2.5 to Improve Reliability on Long Running Coding Tasks

Cursor Releases Composer 2.5 to Improve Reliability on Long Running Coding Tasks

Kimi-k2.5 Powers Cursor Composer 2 via Commercial Open Model Partnership

Kimi-k2.5 Powers Cursor Composer 2 via Commercial Open Model Partnership

Cursor Publishes CursorBench, Its Internal Agentic Coding Evaluation Methodology

Cursor Publishes CursorBench, Its Internal Agentic Coding Evaluation Methodology

Keep reading

Fireworks AI Powers Cursor Composer 2 With Distributed Global RL Infrastructure

Fireworks AI Powers Cursor Composer 2 With Distributed Global RL Infrastructure

Cursor Releases Composer 2.5 to Improve Reliability on Long Running Coding Tasks

Cursor Releases Composer 2.5 to Improve Reliability on Long Running Coding Tasks

Kimi-k2.5 Powers Cursor Composer 2 via Commercial Open Model Partnership

Kimi-k2.5 Powers Cursor Composer 2 via Commercial Open Model Partnership

Cursor Publishes CursorBench, Its Internal Agentic Coding Evaluation Methodology

Cursor Publishes CursorBench, Its Internal Agentic Coding Evaluation Methodology