Google Launches Gemini 3.5 Flash with Frontier Performance for Agentic Coding

Google

May 19, 2026 · Updated Jun 12, 2026

Google moved Gemini 3.5 Flash to general availability, positioning it as the strongest agentic and coding model in the Gemini family. The release delivers frontier-level performance at 4x the speed of comparable competitor models, though pricing has risen 3x versus the previous Gemini 3 Flash.

Google launched Gemini 3.5 Flash globally as the strongest agentic and coding model in the Gemini family. The release delivers frontier-level performance at 4x the speed of comparable competitor models. The gemini-3.5-flash model ID supports a 1M token context window and 65K output tokens.

Speed: 4x comparable competitor models
Pricing: $1.50 input / $9.00 output per 1M tokens (3x previous Gemini 3 Flash)
Model ID: gemini-3.5-flash (1M context, 65K output)
Benchmark: Outperforms Gemini 3.1 Pro on coding and agentic tasks
Next release: Gemini 3.5 Pro shipping next month

The model outperforms the previous Gemini 3.1 Pro on coding and agentic benchmarks, shifting Pro-tier reasoning into Flash-series latency. A new thinking_level parameter (minimal/low/medium/high) replaces sampling controls, while thought preservation carries reasoning across multi-turn conversations.

Access it via the Gemini app, AI Mode in Google Search, Gemini API, Google Antigravity, Android Studio, and Gemini Enterprise Agent Platform. Standard tier pricing is $1.50 input and $9.00 output per million tokens — approximately 3x the cost of the previous Gemini 3 Flash. Gemini 3.5 Pro ships next month — matching GitHub Copilot's integration and OpenRouter's rollout.

View the full update on ai.google.dev

Google

@GoogleMay 19

Meet Gemini 3.5 Flash — our strongest agentic and coding model yet. It delivers frontier-level performance at 4x the speed of comparable frontier models — often at less than half the cost. Generally available, starting today. 🧵 #GoogleIO

9449.5k

View on X

Still wondering? A few quick answers below.

Gemini 3.5 Flash is Google's strongest agentic and coding model, now generally available. It delivers frontier-level performance at four times the speed of comparable competitor frontier models. The model is optimized for autonomous workflows where an AI reasons and executes tools independently across multiple steps.

Gemini 3.5 Flash outperforms the previous-generation Gemini 3.1 Pro on coding and agentic benchmarks, despite being a Flash-series model focused on speed. This represents a shift in efficiency where Flash-series latency now delivers Pro-tier reasoning, eliminating the historical trade-off between speed and capability across Gemini generations.

On the standard paid tier, Gemini 3.5 Flash costs $1.50 per million input tokens and $9.00 per million output tokens — approximately 3x the price of the previous Gemini 3 Flash Preview. A free tier is also available alongside Batch, Flex, and Priority tiers for different workload patterns. See Google's pricing page for full details.

Consumers access it through the Gemini app and AI Mode in Google Search. Developers can use the Gemini API in Google AI Studio, Google Antigravity, and Android Studio. Enterprise customers access it through the dedicated Gemini Enterprise Agent Platform with controls for production deployment.

Google confirmed Gemini 3.5 Pro ships next month. While Gemini 3.5 Flash is now generally available across consumer and developer platforms, the Pro version will follow with higher reasoning depth for the most demanding workflows. The full Gemini 3.5 family rollout continues throughout the year.

Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →

Keep reading

GitHub Integrates Gemini 3.5 Flash for High-Speed Agentic Coding Workflows

GitHub moved Google's Gemini 3.5 Flash to general availability within the Copilot ecosystem, rolling it out across major IDEs and mobile apps. The integration targets agentic coding workflows where high cache efficiency and fast response times are more critical than the raw reasoning depth of larger flagship models.

Simon Willison Analyzes Gemini 3.5 Flash and Finds a Threefold Price Increase

Simon WillisonMay 20

Simon Willison Analyzes Gemini 3.5 Flash and Finds a Threefold Price Increase

Google released Gemini 3.5 Flash at Google I/O 2026, moving the model directly to general availability for developers and consumer products. Independent analysis reveals a threefold price increase over the previous generation, signaling a shift toward higher inference costs for frontier-level speed.

Google Gemini 3.5 Flash Beats Larger Models on Agentic Benchmark

Google AI StudioMay 22

Google Gemini 3.5 Flash Beats Larger Models on Agentic Benchmark

Gemini 3.5 Flash has ranked first on the APEX-Agents-AA benchmark, outperforming larger frontier models in autonomous task execution. The result confirms that high-speed, low-cost models are now capable of handling complex agentic workflows previously reserved for larger architectures.

Google DeepMindMar 3

Google Launches Gemini 3.1 Flash-Lite for High-Volume AI Workloads

Google DeepMind released Gemini 3.1 Flash-Lite, its fastest and most cost-efficient Gemini 3 series model, priced at $0.25/1M input tokens. It outperforms 2.5 Flash with 2.5X faster response times, making high-volume AI tasks significantly cheaper and quicker to run.

What is Gemini 3.5 Flash?

How does Gemini 3.5 Flash compare to Gemini 3.1 Pro?

How much does Gemini 3.5 Flash cost?

Where can I access Gemini 3.5 Flash?

When will Gemini 3.5 Pro be released?

Keep reading

GitHub Integrates Gemini 3.5 Flash for High-Speed Agentic Coding Workflows

GitHub Integrates Gemini 3.5 Flash for High-Speed Agentic Coding Workflows

Simon Willison Analyzes Gemini 3.5 Flash and Finds a Threefold Price Increase

Simon Willison Analyzes Gemini 3.5 Flash and Finds a Threefold Price Increase

Google Gemini 3.5 Flash Beats Larger Models on Agentic Benchmark

Google Gemini 3.5 Flash Beats Larger Models on Agentic Benchmark

Google Launches Gemini 3.1 Flash-Lite for High-Volume AI Workloads

Google Launches Gemini 3.1 Flash-Lite for High-Volume AI Workloads

Keep reading

GitHub Integrates Gemini 3.5 Flash for High-Speed Agentic Coding Workflows

GitHub Integrates Gemini 3.5 Flash for High-Speed Agentic Coding Workflows

Simon Willison Analyzes Gemini 3.5 Flash and Finds a Threefold Price Increase

Simon Willison Analyzes Gemini 3.5 Flash and Finds a Threefold Price Increase

Google Gemini 3.5 Flash Beats Larger Models on Agentic Benchmark

Google Gemini 3.5 Flash Beats Larger Models on Agentic Benchmark

Google Launches Gemini 3.1 Flash-Lite for High-Volume AI Workloads

Google Launches Gemini 3.1 Flash-Lite for High-Volume AI Workloads