Qwen 3.7 Plus is now live on Fireworks. You get the official weights running on our stack. That means full control of latency, throughput, and data path end-to-end, with zero data retention and our 99.9% SLA. Let’s dig in ↓ https://t.co/4JAmGyj9PE
Fireworks AI Adds Qwen 3.7 Plus With Agentic Reasoning and Caching
Fireworks AIFireworks AI now serves Qwen 3.7 Plus as a direct inference provider, offering full control over latency and data paths. The model supports thinking and non-thinking modes, preserved reasoning history, and prompt caching by default. It is available on serverless endpoints compatible with OpenAI and Anthropic APIs, priced at 0.50 dollars per million input tokens.
Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →




