Kimi K2.6 from @Kimi_Moonshot is now available on @FireworksAI_HQ Training Platform across the Managed and Training API workflows. Try SFT, DPO, RL with smart defaults or your own custom loss function with industry leading 265K context window. https://t.co/jqKuwWWEB0
Fireworks AI Adds Kimi K2.6 Training to Build Custom Frontier Agents
Fireworks AIFireworks AI added Moonshot AI's Kimi K2.6 to its training platform, enabling supervised fine-tuning and reinforcement learning on the 1-trillion parameter model. This allows teams to customize the leading open-weight agentic model for specific production workflows while maintaining a 265K context window.
SFT) on the 1-trillion parameter Mixture-of-Experts (MoE) model (architecture activating only relevant sub-networks). The platform supports the model's full 265K context window.- Model
- Kimi K2.6
- Parameters
- 1 Trillion (MoE)
- Context window
- 265K tokens
- Training methods
- SFT, DPO, RL, Custom Loss
- Infrastructure
- Blackwell B200 support
- Availability
- Managed Training and Training API
This release follows Fireworks AI's Day-0 Kimi K2.6 inference support and provides the infrastructure to customize the model powering Cursor's Composer 2. By training and serving on the same hardware, teams avoid numerical drift—where model behavior changes when moving from training to production inference.
You can now use the Training API for custom loss functions or reinforcement learning (RL) at scale. Managed training handles GPU provisioning and distributed scaling for jobs from small adapters to full-parameter tuning. Access is available via the Fireworks dashboard, with pricing based on GPU-hour or token usage.
Still wondering? A few quick answers below.
Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →





