Fireworks AI Adds Qwen 3.5 Training to Build Custom Reasoning Agents

Fireworks AI

Apr 30, 2026

Fireworks AI integrated Alibaba's Qwen 3.5 into its training platform, supporting full-parameter fine-tuning and reinforcement learning with a 256K context window. This allows developers to customize the high-performance open-weight model for specialized reasoning and coding tasks on a unified stack.

Fireworks AI, an inference platform for fast model serving and compound AI systems, added Qwen 3.5 to its training platform via Managed and Training API workflows. The update supports supervised fine-tuning (adapting a model to specific instructions) and reinforcement learning (training via feedback loops) while maintaining a 256K context window.

Context window: 256K tokens
Training methods: SFT, DPO, RL
Fine-tuning types: LoRA, Full-parameter
Access: Managed UI, Training API
Customization: Custom loss functions, smart defaults

This follows a rapid expansion of the Fireworks AI training platform. Matching Alibaba's Qwen 3.5 release, Fireworks enables teams to build proprietary reasoning models that rival closed-source systems without the drift of fragmented training and inference stacks, while also building on Fireworks AI's safe tokenization to secure model boundaries.

You can now run SFT, DPO, or RL jobs using smart defaults or custom loss functions. The platform supports both LoRA (efficient parameter-efficient tuning) and full-parameter fine-tuning for advanced tasks. These workflows are available now through the Fireworks dashboard or Training API for the Qwen 3.5 model family.

View the full update on fireworks.ai

Fireworks AI

@FireworksAI_HQApr 29

Qwen 3.5 from @Alibaba_Qwen is now available on @FireworksAI_HQ Training Platform across the Managed and Training API workflows. Try SFT, DPO, RL with smart defaults or your own custom loss function with a 256K context window. We support Lora as well as full param fine tuning for your most advanced tasks! What would you like to see next? https://t.co/rqSamw3I3e

127

View on X

Still wondering? A few quick answers below.

Fireworks AI supports several training methods for the Qwen 3.5 model family, including supervised fine-tuning, direct preference optimization, and reinforcement learning. Users can choose between using smart defaults provided by the platform or implementing their own custom loss functions to optimize the model for specific reasoning, coding, or mathematical tasks.

Yes, the Fireworks AI training platform supports both full-parameter fine-tuning and LoRA, which is a more memory-efficient method called Low-Rank Adaptation. Full-parameter tuning allows for deep customization of the model weights for advanced tasks, while LoRA provides a faster and less resource-intensive way to adapt the model to new data.

When training Qwen 3.5 on the Fireworks AI platform, users can utilize a context window of up to 256K tokens. This large window allows the model to process and learn from extensive datasets, such as long documents or complex codebases, without losing the ability to understand long-range dependencies during the fine-tuning or reinforcement learning process.

Qwen 3.5 training is available through two primary workflows on the Fireworks AI platform. Developers can use the Managed Training interface for a guided experience or integrate the Training API directly into their own development pipelines. These workflows are designed to help teams move from training to production inference on a single unified stack.

Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →

See all AI news & updates from Fireworks AI →

Keep reading

Fireworks AI Adds Managed Fine-Tuning for Qwen 3.6 27B

Fireworks AI launched managed fine-tuning for Alibaba's Qwen 3.6 27B model, supporting 256K context windows and out-of-the-box DPO. This allows developers to specialize a high-performance dense model for complex coding and reasoning tasks on a production-ready stack.

Qwen Partners with Fireworks AI for Global Access to Qwen 3.6 Plus

QwenMay 1

Qwen Partners with Fireworks AI for Global Access to Qwen 3.6 Plus

Alibaba's Qwen team partnered with Fireworks AI to provide production-ready access to its closed-weights Qwen 3.6 Plus model. This move gives global developers a low-latency, cost-effective way to run Alibaba's flagship intelligence without using Chinese cloud infrastructure.

Vercel Integrates Qwen 3.7 Max to Power Autonomous Multi Step Agent Workflows

VercelMay 21

Vercel Integrates Qwen 3.7 Max to Power Autonomous Multi Step Agent Workflows

Vercel added Alibaba's Qwen 3.7 Max to its AI Gateway, enabling developers to access the agent-focused model without separate provider accounts. The model is optimized for long-horizon execution, allowing it to maintain reasoning across complex, multi-step tasks like multi-file engineering and office automation.

Nous Research Adds Qwen 3.7 Max Support to Hermes Agent

Nous ResearchMay 27

Nous Research Adds Qwen 3.7 Max Support to Hermes Agent

Nous Research integrated Alibaba's Qwen 3.7 Max into its open-source Hermes Agent platform. This allows users to power autonomous multi-step workflows with reasoning models while benefiting from recent cost-saving context caching.

What training methods does Fireworks AI support for Qwen 3.5?

Can I perform full-parameter fine-tuning on Qwen 3.5 with Fireworks AI?

What is the context window for Qwen 3.5 training on Fireworks AI?

How can I access the Qwen 3.5 training workflows on Fireworks AI?

Keep reading

Fireworks AI Adds Managed Fine-Tuning for Qwen 3.6 27B

Fireworks AI Adds Managed Fine-Tuning for Qwen 3.6 27B

Qwen Partners with Fireworks AI for Global Access to Qwen 3.6 Plus

Qwen Partners with Fireworks AI for Global Access to Qwen 3.6 Plus

Vercel Integrates Qwen 3.7 Max to Power Autonomous Multi Step Agent Workflows

Vercel Integrates Qwen 3.7 Max to Power Autonomous Multi Step Agent Workflows

Nous Research Adds Qwen 3.7 Max Support to Hermes Agent

Nous Research Adds Qwen 3.7 Max Support to Hermes Agent

Keep reading

Fireworks AI Adds Managed Fine-Tuning for Qwen 3.6 27B

Fireworks AI Adds Managed Fine-Tuning for Qwen 3.6 27B

Qwen Partners with Fireworks AI for Global Access to Qwen 3.6 Plus

Qwen Partners with Fireworks AI for Global Access to Qwen 3.6 Plus

Vercel Integrates Qwen 3.7 Max to Power Autonomous Multi Step Agent Workflows

Vercel Integrates Qwen 3.7 Max to Power Autonomous Multi Step Agent Workflows

Nous Research Adds Qwen 3.7 Max Support to Hermes Agent

Nous Research Adds Qwen 3.7 Max Support to Hermes Agent