Fireworks AI Powers Cursor Composer 2 With Distributed Global RL Infrastructure

Fireworks AI

Mar 28, 2026 · Updated Apr 25, 2026

Fireworks AI revealed the infrastructure behind Cursor's Composer 2, using disaggregated sampling to run RL across multiple global clusters. By shipping only 2% of model weights as compressed deltas, they eliminated the need for a single massive mega-cluster. This shift makes frontier-scale RL training economically viable using fragmented, multi-region GPU capacity.

Fireworks AI introduced a disaggregated sampling architecture that exploits weight sparsity in Reinforcement Learning. Between consecutive checkpoints, over 98% of weights in bf16 remain bit-equivalent. Instead of transferring a full 1TB model, the system sends a 20GB compressed delta, reducing cross-region traffic by 98% while maintaining exact reconstruction.

This approach challenges the assumption that frontier RL requires a single, co-located mega-cluster. By making policy updates small, teams can use fragmented GPU capacity across different regions. Cursor used this to train Composer 2 across four global clusters, turning distributed inference into a unified pool for generating training data.

You can implement this via the Fireworks Training SDK, which supports fully managed RL or a "bring your own trainer" model. The platform provides OpenAI-compatible sampling endpoints and a weight update API. These tools bound policy staleness to a few minutes and keep in-memory GPU swaps under 60 seconds.

View the full update on fireworks.ai

Fireworks AI

@FireworksAI_HQMar 24

We’re seeing lots of interest in how Cursor delivered Composer 2. One less obvious insight: you don't need to spend billions on a giant cluster to do reinforcement learning. With disaggregated sampling, we ran @Cursor_ai Composer 2 training across 3-4 clusters worldwide, with a unified capacity of Fireworks Virtual Cloud. Check how we optimize cross-region 1TB+ model updates by 98%+ while keeping staleness under a few minutes: https://t.co/0Ziv6ssFNx

27329

View on X

Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →

See all AI news & updates from Fireworks AI →

Keep reading

Cursor and Fireworks AI Detail the Specialized Training Infrastructure Behind Composer 2.5

Cursor and Fireworks AI shared a technical breakdown of the distributed reinforcement learning infrastructure used to build the Composer 2.5 coding model. The team treats model weights as finite storage bits dedicated entirely to software engineering, allowing the model to match frontier performance at one-tenth the cost. This shift demonstrates how specialized products can use real-world usage as a proprietary training loop.

Cursor Releases Composer 2 Technical Report on Coding Agent Training

CursorMar 26

Cursor Releases Composer 2 Technical Report on Coding Agent Training

Cursor published a technical report on Composer 2, a coding agent trained via pretraining on Kimi K2.5 and RL on real engineering tasks. It scores 61.3 on CursorBench — 37% above Composer 1.5 — matching frontier models at lower cost.

Kimi-k2.5 Powers Cursor Composer 2 via Commercial Open Model Partnership

KimiMar 21

Kimi-k2.5 Powers Cursor Composer 2 via Commercial Open Model Partnership

Moonshot AI confirmed Kimi-k2.5 is the foundation model behind Cursor Composer 2. Cursor applied continued pretraining and high-compute RL training on top of it, with inference hosted via FireworksAI under an authorized commercial partnership.