Fireworks AI Launches Full Parameter RL Training for Kimi K2.6

Fireworks AI

May 13, 2026 · Updated Jun 12, 2026

Fireworks AI added full-parameter reinforcement learning support for Moonshot AI's 1-trillion parameter Kimi K2.6 model. This allows developers to tune the entire model weight set on proprietary data to build specialized agentic moats that outperform off-the-shelf frontier systems.

Fireworks AI, an inference platform for fast model serving and training, launched full-parameter reinforcement learning (RL) for Kimi K2.6. While Fireworks AI's Day-0 Kimi K2.6 support focused on inference, this update enables tuning the entire 1-trillion parameter set rather than relying on adapters.

Model: Kimi K2.6
Total parameters: 1 trillion
Active parameters: 32 billion (MoE)
Context window: 256K tokens
Availability: Private preview

This shift allows teams to build proprietary data moats by owning the model's core behavior. By training on an open-weight base with specialized data, companies can create models that outperform generic frontier APIs. The platform utilizes Fireworks AI's delta-compressed weight updates to sync training and inference clusters across fragmented GPU capacity.

You can implement custom loss functions and rewards in Python while Fireworks manages the distributed GPU infrastructure and FSDP. The Training API is in private preview, supporting the model's native 256K context window for long-horizon agentic tasks. Access is available by request through the Fireworks contact portal.

View the full update on docs.fireworks.ai

Fireworks AI

@FireworksAI_HQMay 12

𝐅𝐮𝐥𝐥-𝐏𝐚𝐫𝐚𝐦 𝐑𝐋 𝐧𝐨𝐰 𝐚𝐯𝐚𝐢𝐥𝐚𝐛𝐥𝐞 𝐟𝐨𝐫 𝐊𝐢𝐦𝐢 𝐊𝟐.𝟔 You've been told only 3 AI labs matter. The best AI apps never believed that. @cursor_ai, @vercel, @genspark_ai don't run only off-the-shelf models. They train on open-source bases with their own data and run continuous RL to pull ahead. LoRA gets you in the door. Full-param RL is true model ownership for the maximum data moat. Today, Kimi K2.6 full param tuning is now available on Fireworks Training. 256K context. Train the whole thing. Ready to get started? https://t.co/due6j5oNBl

572

View on X

Still wondering? A few quick answers below.

It is a training method that allows developers to update all weights of the 1-trillion parameter Kimi K2.6 model simultaneously. Unlike LoRA, which uses smaller adapters, full-parameter tuning enables deeper model ownership and the creation of proprietary data moats by optimizing the entire model for specific agentic or coding tasks.

The API uses a service-mode architecture where you write training logic, such as custom loss and reward functions, in plain Python on your local machine. Fireworks handles the heavy lifting, including GPU provisioning, distributed forward and backward passes, and sharding model parameters across chips using Fully Sharded Data Parallel techniques.

The Training API for Kimi K2.6 is currently in private preview. Interested developers and organizations must request early access through the Fireworks AI website to begin using the platform. Once granted access, users can leverage the 256K context window to build specialized models for long-horizon agentic workflows.

Fireworks supports the full 256K token context window for Kimi K2.6 during training. This allows the model to process and learn from massive datasets, which is essential for long-horizon tasks like autonomous coding or complex document analysis where maintaining coherence over long sequences of information is a primary requirement.

Fireworks uses a distributed architecture that employs delta-compressed weight updates to synchronize training and inference clusters. By only shipping the small percentage of weights that change between checkpoints, the platform reduces the bandwidth and compute costs typically associated with training frontier-scale models like the 1-trillion parameter Kimi K2.6.

Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →

See all AI news & updates from Fireworks AI →

Keep reading

Fireworks AI Adds Kimi K2.6 Training to Build Custom Frontier Agents

Fireworks AI added Moonshot AI's Kimi K2.6 to its training platform, enabling supervised fine-tuning and reinforcement learning on the 1-trillion parameter model. This allows teams to customize the leading open-weight agentic model for specific production workflows while maintaining a 265K context window.

Moonshot AI Launches Kimi K2.6 with 4000 Step Long Horizon Coding

KimiApr 24

Moonshot AI Launches Kimi K2.6 with 4000 Step Long Horizon Coding

Moonshot AI released Kimi K2.6, an open-source model that achieves state-of-the-art scores on SWE-Bench Pro and Toolathlon. The update introduces long-horizon coding, enabling agents to execute over 4,000 autonomous steps without losing context or drifting from the task.

Cloudflare Workers AI Adds Kimi K2.5 for End-to-End Agent Workflows

CloudflareMar 20

Cloudflare Workers AI Adds Kimi K2.5 for End-to-End Agent Workflows

Cloudflare's Workers AI now supports Kimi K2.5, Moonshot AI's frontier open-source model with a 256k context window. Developers can build and run full agent workflows on Cloudflare's platform, with prefix caching and a new async API cutting inference costs.

What is full-parameter reinforcement learning for Kimi K2.6?

How does the Fireworks Training API work?

Is Kimi K2.6 full-parameter training available to everyone?

What is the context window for Kimi K2.6 training on Fireworks?

How does Fireworks reduce the cost of frontier RL training?

Keep reading

Fireworks AI Adds Kimi K2.6 Training to Build Custom Frontier Agents

Fireworks AI Adds Kimi K2.6 Training to Build Custom Frontier Agents

Moonshot AI Launches Kimi K2.6 with 4000 Step Long Horizon Coding

Moonshot AI Launches Kimi K2.6 with 4000 Step Long Horizon Coding

Cloudflare Workers AI Adds Kimi K2.5 for End-to-End Agent Workflows

Cloudflare Workers AI Adds Kimi K2.5 for End-to-End Agent Workflows

Keep reading

Fireworks AI Adds Kimi K2.6 Training to Build Custom Frontier Agents

Fireworks AI Adds Kimi K2.6 Training to Build Custom Frontier Agents

Moonshot AI Launches Kimi K2.6 with 4000 Step Long Horizon Coding

Moonshot AI Launches Kimi K2.6 with 4000 Step Long Horizon Coding

Cloudflare Workers AI Adds Kimi K2.5 for End-to-End Agent Workflows

Cloudflare Workers AI Adds Kimi K2.5 for End-to-End Agent Workflows