Opus 4.8 is live on OpenRouter! Same price as 4.7 with gains across agentic coding, reasoning, and computer use. Around 4x less likely than 4.7 to let code flaws pass unremarked. Opus 4.8 Fast Mode is also live - now only 2x the cost for 2.5x the speed. https://t.co/vdSGB1BBKR
OpenRouter Launches Claude Opus 4.8 With Aggressive Fast Mode Pricing
OpenRouter, a unified API platform for accessing hundreds of LLMs, launched Anthropic's Claude Opus 4.8 alongside a high-speed Fast Mode variant. The new model improves on reasoning, computer use, and agentic coding, claiming a 4x reduction in overlooked code flaws.
- Model
- Claude Opus 4.8
- Speed (Fast Mode)
- 2.5x faster
- Pricing (Fast Mode)
- 2x standard cost
- Code flaw detection
- 4x improvement vs 4.7
- New feature
- Mid-conversation system messages
This release shifts the economics of high-performance inference. While previous Opus 4.7 Fast Mode implementations required a 6x price premium, OpenRouter has slashed the cost of Opus 4.8 Fast Mode to just 2x. This makes low-latency frontier reasoning viable for production agentic loops.
The release also enables mid-conversation system messages. You can access Opus 4.8 via the OpenRouter API immediately at the same price as 4.7, while Fast Mode delivers 2.5x speed for double the cost. These updates are particularly relevant for developers using Claude Code to maintain fluid workflows.
OpenRouter
@OpenRouter
10retweets365likes
View on XStill wondering? A few quick answers below.
Claude Opus 4.8 is the latest iteration of Anthropic's flagship reasoning model available through the OpenRouter API. This version introduces performance improvements in reasoning, computer use, and agentic coding. It is specifically designed to be more reliable for developers, as it is four times less likely to overlook code flaws compared to the previous version.
The standard version of Claude Opus 4.8 is priced identically to the previous 4.7 version on OpenRouter. For users requiring higher performance, the platform also offers a Fast Mode variant. While previous high-speed tiers carried a 6x price premium, the new Opus 4.8 Fast Mode is priced at only twice the cost of the standard model.
Fast Mode is a high-speed inference tier for the Claude Opus 4.8 model that delivers 2.5x faster response speeds compared to the standard version. This mode is optimized for latency-sensitive tasks like interactive coding or autonomous agent loops where execution speed is critical, and it is now available at a reduced 2x price premium.
Yes, the integration of Claude Opus 4.8 on OpenRouter includes support for injecting system messages in the middle of a conversation. This technical capability allows developers to provide new instructions or constraints to the model as a dialogue progresses, offering finer control over the model's behavior and reasoning during complex, multi-step interactions.
Claude Opus 4.8 improves agentic coding by significantly increasing the model's ability to detect and remark on flaws within a codebase. According to the release data, the model is approximately four times less likely than version 4.7 to let code errors pass without notice, making it a more effective tool for autonomous software development and debugging.




