Fireworks AI Adds Qwen 3.7 Plus With Agentic Reasoning and Caching

Fireworks AIFireworks AI

Fireworks AI now serves Qwen 3.7 Plus as a direct inference provider, offering full control over latency and data paths. The model supports thinking and non-thinking modes, preserved reasoning history, and prompt caching by default. It is available on serverless endpoints compatible with OpenAI and Anthropic APIs, priced at 0.50 dollars per million input tokens.

Model performance benchmarks across coding, agentic, and multimodal tasks comparing Qwen, DeepSeek, GLM, Kimi, Claude, GPT, and Gemini.
Fireworks AI
Fireworks AI
@FireworksAI_HQ
X

Qwen 3.7 Plus is now live on Fireworks. You get the official weights running on our stack. That means full control of latency, throughput, and data path end-to-end, with zero data retention and our 99.9% SLA. Let’s dig in ↓ https://t.co/4JAmGyj9PE

2retweets50likes
View on X

Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →

Share this update