Most teams can pick frontier models. Fewer can run them at production scale without hitting constraints in latency, throughput, and governance. Fireworks AI on @Azure AI Foundry provides the inference layer for that environment. Learn more: https://t.co/Ym0YrQ5Pmi
Fireworks AI Expands Azure Foundry Catalog With Frontier Reasoning Models
Fireworks AI· Updated
Fireworks AI added DeepSeek V4 Pro and Kimi K2.6 to Microsoft Azure AI Foundry while expanding provisioned throughput support to the US Data Zone. The update allows enterprise teams to run high-performance open models with guaranteed throughput and data residency within their existing Azure environment.
- Kimi K2.6 Input
- $0.95 per 1M tokens
- Kimi K2.6 Output
- $4.00 per 1M tokens
- DeepSeek V4 Pro Input
- $1.75 per 1M tokens
- DeepSeek V4 Pro Output
- $3.48 per 1M tokens
- Availability
- Microsoft Azure AI Foundry
- Infrastructure
- Provisioned Throughput Units (PTUs)
Scaling open models often requires re-architecting for new providers or compromising on governance. By integrating these models into the Azure control plane, organizations bypass separate security reviews. This move builds on the Fireworks AI 15 trillion token milestone where throughput and residency are the primary bottlenecks for enterprise deployment.
You can now deploy DeepSeek V4 Pro or Kimi K2.6 using a single Azure endpoint. Serverless pricing for Kimi K2.6 starts at $0.95 per million input tokens, while DeepSeek V4 Pro costs $1.75 per million. For production workloads, you can request PTU capacity to ensure consistent throughput with US data residency.
Still wondering? A few quick answers below.
Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →
