Gemma 4 (26B + 31B) from @GoogleDeepMind is now available on the Fireworks Training Platform across the Managed and Training API workflows. Try SFT and DPO with smart defaults or your own custom loss function with a 256K context window. RL support landing soon! What would you like to see next? https://t.co/rqSamw3I3e
Fireworks AI Adds Gemma 4 Training to Build Custom Reasoning Agents
Fireworks AIFireworks AI integrated Google's Gemma 4 models into its training platform, enabling full-parameter fine-tuning and DPO with a 256K context window. This allows teams to build specialized reasoning agents on a unified stack that transitions from training to production inference in seconds.
- Model variants
- 26B and 31B
- Context window
- 256K tokens
- Training methods
- SFT, DPO, Full-parameter
- Max model scale
- 1T parameters
- Numerical parity
- < 0.01 KL divergence
- Pricing
- Per token or GPU-hour
This integration follows Fireworks AI's Day-0 support for Kimi K2.6 and mirrors the shift toward owning specialized open-weight models. By providing infrastructure for training trillion-parameter models, Fireworks removes the "migration tax" where models behave differently in training than in production. Gemma 4's 256K context window suits complex agentic tasks.
You can start training via the Training Agent for automated pipelines or the Training API for custom loss functions. The platform ensures training-inference parity, so evaluated checkpoints match production performance exactly. Managed training is priced per token; the Training API uses predictable per-GPU-hour pricing.
Still wondering? A few quick answers below.
Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →




