Gemini 3.5 Flash from @GoogleDeepMind is live on OpenRouter! Beats Gemini 3.1 Pro on coding, agentic work, and tool use at Flash-tier price and speed. 1M context, 65K max output, multimodal. $1.50/M input, $9/M output. https://t.co/VkTuvWz2ey
OpenRouter Adds Gemini 3.5 Flash for High Performance Agentic Coding
- Context window
- 1,048,576 tokens
- Max output
- 65,536 tokens
- Pricing (input)
- $1.50 per million tokens
- Pricing (output)
- $9.00 per million tokens
- Thinking levels
- Minimal, Low, Medium, and more
- Input modalities
- Text, Image, Video, and more
This release shifts the performance-to-price ratio by beating the older Gemini 3.1 Pro on software engineering benchmarks, mirroring Google's Gemini 3.5 Flash launch. It follows the recent addition of Gemini 3.1 Flash Lite to the platform, but offers higher reasoning depth. Moving frontier-level coding into the Flash tier makes high-volume agentic development economically viable.
You can access the model via the OpenRouter API for $1.50 per million input tokens, matching Vercel's Gemini 3.5 Flash integration. The implementation supports fine-grained thinking levels to balance cost and intelligence, similar to Google's inference tiers. Developers can use the reasoning parameter to inspect the model's internal step-by-step logic.
Still wondering? A few quick answers below.



