HeadsUpAI

Google Launches Gemini 3.5 Flash with Frontier Performance for Agentic Coding

Google launched Gemini 3.5 Flash globally as the strongest agentic and coding model in the Gemini family. The release delivers frontier-level performance at 4x the speed of comparable competitor models. The gemini-3.5-flash model ID supports a 1M token context window and 65K output tokens.
Speed
4x comparable competitor models
Pricing
$1.50 input / $9.00 output per 1M tokens (3x previous Gemini 3 Flash)
Model ID
gemini-3.5-flash (1M context, 65K output)
Benchmark
Outperforms Gemini 3.1 Pro on coding and agentic tasks
Next release
Gemini 3.5 Pro shipping next month

The model outperforms the previous Gemini 3.1 Pro on coding and agentic benchmarks, shifting Pro-tier reasoning into Flash-series latency. A new thinking_level parameter (minimal/low/medium/high) replaces sampling controls, while thought preservation carries reasoning across multi-turn conversations.

Access it via the Gemini app, AI Mode in Google Search, Gemini API, Google Antigravity, Android Studio, and Gemini Enterprise Agent Platform. Standard tier pricing is $1.50 input and $9.00 output per million tokens — approximately 3x the cost of the previous Gemini 3 Flash. Gemini 3.5 Pro ships next month — matching GitHub Copilot's integration and OpenRouter's rollout.

Google
Google
@Google
X

Meet Gemini 3.5 Flash — our strongest agentic and coding model yet. It delivers frontier-level performance at 4x the speed of comparable frontier models — often at less than half the cost. Generally available, starting today. 🧵 #GoogleIO

915retweets9.3klikes
View on X

Still wondering? A few quick answers below.

Gemini 3.5 Flash is Google's strongest agentic and coding model, now generally available. It delivers frontier-level performance at four times the speed of comparable competitor frontier models. The model is optimized for autonomous workflows where an AI reasons and executes tools independently across multiple steps.

Gemini 3.5 Flash outperforms the previous-generation Gemini 3.1 Pro on coding and agentic benchmarks, despite being a Flash-series model focused on speed. This represents a shift in efficiency where Flash-series latency now delivers Pro-tier reasoning, eliminating the historical trade-off between speed and capability across Gemini generations.

On the standard paid tier, Gemini 3.5 Flash costs $1.50 per million input tokens and $9.00 per million output tokens — approximately 3x the price of the previous Gemini 3 Flash Preview. A free tier is also available alongside Batch, Flex, and Priority tiers for different workload patterns. See Google's pricing page for full details.

Consumers access it through the Gemini app and AI Mode in Google Search. Developers can use the Gemini API in Google AI Studio, Google Antigravity, and Android Studio. Enterprise customers access it through the dedicated Gemini Enterprise Agent Platform with controls for production deployment.

Google confirmed Gemini 3.5 Pro ships next month. While Gemini 3.5 Flash is now generally available across consumer and developer platforms, the Pro version will follow with higher reasoning depth for the most demanding workflows. The full Gemini 3.5 family rollout continues throughout the year.

Share this update