Google Gemma 4 E4B Drives iOS Simulator for Local On-Device Automation

Google Gemma

May 22, 2026 · Updated Jun 13, 2026

Google demonstrated Gemma 4 E4B autonomously navigating an iOS simulator using the Argent framework. This shift proves that lightweight, open-weight models can handle complex software interactions locally, reducing the need for cloud-based computer use.

Google demonstrated Gemma 4 E4B performing autonomous on-device automation by driving an iOS simulator directly. Using a framework called Argent, the lightweight model navigates mobile software interfaces and handles complex interactions without human direction. This follows the Gemma 4 launch, which brought frontier-level reasoning to consumer hardware.

Most computer use currently relies on massive cloud models due to the high reasoning requirements of UI navigation. By proving a small, edge-optimized model can manage these tasks, Google is lowering the barrier for private, low-latency automation. It extends the Gemma 4 31B autonomous debugging capabilities to mobile environments.

You can now explore local agentic workflows that interact with mobile applications without sending screen data to external servers. This capability is particularly relevant for automated testing and personalized mobile assistants that require high data sovereignty. The Gemma 4 family remains available under an open-weight license for local use.

Google Gemma

@googlegemmaMay 21

We are entering a new era of on-device automation. ✨ Watch Gemma 4 E4B navigate and drive an iOS simulator directly using Argent. Local models can handle complex interactions and software navigation autonomously. https://t.co/xuXqx3flOD

5536.1k

View on X

Still wondering? A few quick answers below.

Gemma 4 E4B is a lightweight, open-weight AI model developed by Google DeepMind. It is part of the Gemma 4 family, which is built on the same architecture as Google's Gemini models. This specific version is optimized for high-performance reasoning on local devices and consumer hardware rather than relying on cloud-based infrastructure.

The model uses a framework called Argent to navigate and drive an iOS simulator directly. This capability, known as computer use, allows the AI to interact with graphical user interfaces by clicking, typing, and navigating applications autonomously. It processes complex software interactions locally on the device to complete multi-step tasks without human intervention.

Yes, Gemma 4 E4B is an open-weight model, meaning its trained parameters are publicly released for developers to download and run on their own hardware. This allows for private, offline agentic workflows where data sovereignty is a priority, as the model does not need to send screen information or data to external servers.

Argent is the automation framework that enables Gemma 4 models to interact directly with software environments like an iOS simulator. It serves as the bridge between the AI model's reasoning and the computer's interface, allowing the model to execute actions and navigate software autonomously as part of an on-device automation loop.

Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →

See all AI news & updates from Google →

Keep reading

Google Launches Gemma 4 to Bring Frontier Reasoning to Local Devices

Google released Gemma 4, a new family of open models built on the same architecture as Gemini 3 and licensed under Apache 2.0. These models deliver high-performance reasoning and native multimodal capabilities directly on consumer hardware, enabling private, offline agentic workflows. This shift allows developers to build sophisticated AI applications that run entirely on-device without sacrificing intelligence.

Google GemmaMay 29

Google Launches On-Device Agent Skills for Offline Gemma 4 Workflows

Google released the Google AI Edge Gallery app and LiteRT-LM framework to enable fully offline agentic workflows on mobile and IoT devices. By running Gemma 4 locally, developers can build multi-step agents that plan, use tools, and process multimodal data without cloud latency or privacy risks.

Ollama Adds Google DeepMind's Gemma 4 12B for Local Agentic AI

OllamaJun 7

Ollama Adds Google DeepMind's Gemma 4 12B for Local Agentic AI

Ollama has made Google DeepMind's Gemma 4 12B model available for local execution, including support for chat and agentic applications. This expands access to a powerful, open-weight multimodal model optimized for on-device reasoning and coding, enabling private and offline AI workflows on consumer hardware.

Arena Ranks Google Gemma 4 as Top Open Vision Model

ArenaMay 8

Arena Ranks Google Gemma 4 as Top Open Vision Model

Google's Gemma-4-31b and Gemma-4-26b-a4b have entered the Vision Arena leaderboard as the #2 and #4 ranked open models. These releases shift the price-performance frontier by delivering vision reasoning capabilities that rival proprietary systems at a fraction of the cost.

What is Gemma 4 E4B?

How does Gemma 4 E4B automate mobile software?

Is Gemma 4 E4B available for local use?

What is the Argent framework used by Google Gemma?

Keep reading

Google Launches Gemma 4 to Bring Frontier Reasoning to Local Devices

Google Launches Gemma 4 to Bring Frontier Reasoning to Local Devices

Google Launches On-Device Agent Skills for Offline Gemma 4 Workflows

Google Launches On-Device Agent Skills for Offline Gemma 4 Workflows

Ollama Adds Google DeepMind's Gemma 4 12B for Local Agentic AI

Ollama Adds Google DeepMind's Gemma 4 12B for Local Agentic AI

Arena Ranks Google Gemma 4 as Top Open Vision Model

Arena Ranks Google Gemma 4 as Top Open Vision Model

Keep reading

Google Launches Gemma 4 to Bring Frontier Reasoning to Local Devices

Google Launches Gemma 4 to Bring Frontier Reasoning to Local Devices

Google Launches On-Device Agent Skills for Offline Gemma 4 Workflows

Google Launches On-Device Agent Skills for Offline Gemma 4 Workflows

Ollama Adds Google DeepMind's Gemma 4 12B for Local Agentic AI

Ollama Adds Google DeepMind's Gemma 4 12B for Local Agentic AI

Arena Ranks Google Gemma 4 as Top Open Vision Model

Arena Ranks Google Gemma 4 as Top Open Vision Model