Google Gemini 3.5 Flash Ranks First on Zapier Automation Benchmark

Google AI Studio

May 21, 2026 · Updated Jun 12, 2026

Gemini 3.5 Flash took the top spot on Zapier's Automation Bench, outperforming all other frontier models in operations and support tasks. The result validates Google's strategy of delivering high-speed, low-cost models that maintain competitive intelligence for autonomous workflows.

Google's Gemini 3.5 Flash model ranked first on the Automation Bench from Zapier, an evaluation designed to measure performance in real-world operations and support tasks. The model outperformed every other frontier model tested, including larger flagship systems, while operating at a significantly lower inference cost.

Benchmark: Zapier Automation Bench
Ranking: 1st place
Context window: 1 million tokens
Availability: Gemini API and Google AI Studio
Primary use cases: Operations and Support

This ranking follows the Gemini 3.5 Flash launch and provides third-party validation for Google's architecture. While Arena.ai ranks Gemini 3.5 Flash highly for coding, the Zapier results highlight its reliability in multi-step automation, following a pattern seen in the APEX-Agents-AA benchmark.

You can now prioritize Gemini 3.5 Flash for high-volume automation tasks where cost and latency are critical constraints. The model is available via the Gemini API and Google AI Studio, offering a one-million-token context window for complex data mapping. Its performance in support and operations makes it a viable candidate for replacing expensive models.

View the full update on zapier.com

Logan Kilpatrick

@OfficialLoganKMay 21

Gemini 3.5 Flash ranks #1 on Automation Bench (from Zapier), beating every other frontier model at a much lower cost https://t.co/UeXp5W7M1h

541.2k

View on X

Still wondering? A few quick answers below.

The Automation Bench is a specialized evaluation framework created by Zapier to measure how effectively AI models handle real-world automation tasks. It specifically tests capabilities in operations and support categories, focusing on a model's ability to use tools, map data, and execute multi-step workflows accurately within an autonomous agentic environment.

Gemini 3.5 Flash ranked first on the Automation Bench, outperforming all other current frontier models. The results show that the model is particularly effective at handling complex operational and support tasks. It achieved this top ranking while maintaining a significantly lower inference cost compared to the larger flagship models it competed against.

Gemini 3.5 Flash is currently available through the Gemini API and Google AI Studio. Developers can use these platforms to integrate the model into their own applications and workflows. The model supports a one-million-token context window, allowing it to process massive amounts of information, such as entire codebases or long documents, in a single request.

While Flash models are typically designed for speed, Gemini 3.5 Flash is categorized as a frontier model because its intelligence levels match or exceed the most capable models available. Its top ranking on the Zapier benchmark validates that it can handle high-stakes reasoning and tool-use tasks that were previously reserved for much larger and more expensive systems.

Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →

See all AI news & updates from Google →

Keep reading

Google Gemini 3.5 Flash Beats Larger Models on Agentic Benchmark

Gemini 3.5 Flash has ranked first on the APEX-Agents-AA benchmark, outperforming larger frontier models in autonomous task execution. The result confirms that high-speed, low-cost models are now capable of handling complex agentic workflows previously reserved for larger architectures.

Google Launches Gemini 3.5 Flash with Frontier Performance for Agentic Coding

GoogleMay 21

Google Launches Gemini 3.5 Flash with Frontier Performance for Agentic Coding

Google moved Gemini 3.5 Flash to general availability, positioning it as the strongest agentic and coding model in the Gemini family. The release delivers frontier-level performance at 4x the speed of comparable competitor models, though pricing has risen 3x versus the previous Gemini 3 Flash.

Arena.ai Ranks Google Gemini 3.5 Flash in Top Ten for Coding

ArenaMay 19

Arena.ai Ranks Google Gemini 3.5 Flash in Top Ten for Coding

Gemini 3.5 Flash has entered the Arena.ai leaderboards with a ninth-place ranking in both the overall Text and Frontend Coding categories. The model establishes a new price-performance frontier by delivering a 70-point jump in coding capability over its predecessor.

WarpMay 20

Warp Adds Gemini 3.5 Flash to Power High Speed Agentic Terminal Workflows

Warp integrated Google's Gemini 3.5 Flash into its terminal-based AI agent to optimize autonomous development tasks. The update provides a high-speed, cost-effective alternative for multi-step coding loops where low latency is more critical than raw reasoning depth.

What is the Zapier Automation Bench?

How did Gemini 3.5 Flash perform on the Zapier benchmark?

Where can developers access Gemini 3.5 Flash?

Why is Gemini 3.5 Flash considered a frontier model for automation?

Keep reading

Google Gemini 3.5 Flash Beats Larger Models on Agentic Benchmark

Google Gemini 3.5 Flash Beats Larger Models on Agentic Benchmark

Google Launches Gemini 3.5 Flash with Frontier Performance for Agentic Coding

Google Launches Gemini 3.5 Flash with Frontier Performance for Agentic Coding

Arena.ai Ranks Google Gemini 3.5 Flash in Top Ten for Coding

Arena.ai Ranks Google Gemini 3.5 Flash in Top Ten for Coding

Warp Adds Gemini 3.5 Flash to Power High Speed Agentic Terminal Workflows

Warp Adds Gemini 3.5 Flash to Power High Speed Agentic Terminal Workflows

Keep reading

Google Gemini 3.5 Flash Beats Larger Models on Agentic Benchmark

Google Gemini 3.5 Flash Beats Larger Models on Agentic Benchmark

Google Launches Gemini 3.5 Flash with Frontier Performance for Agentic Coding

Google Launches Gemini 3.5 Flash with Frontier Performance for Agentic Coding

Arena.ai Ranks Google Gemini 3.5 Flash in Top Ten for Coding

Arena.ai Ranks Google Gemini 3.5 Flash in Top Ten for Coding

Warp Adds Gemini 3.5 Flash to Power High Speed Agentic Terminal Workflows

Warp Adds Gemini 3.5 Flash to Power High Speed Agentic Terminal Workflows