AWS Moves Blackwell GPU Compute to the Edge in Los Angeles

LLM
Enterprise AI
AI Agent
AI Hardware
Performance

AWS Moves Blackwell GPU Compute to the Edge in Los Angeles
Amazon Web Services (AWS) launched the general availability of Amazon EC2 G7e instances in its Los Angeles Local Zone. These instances feature NVIDIA RTX PRO 6000 Blackwell Server Edition GPUs and 5th generation Intel Xeon Scalable processors. This expansion brings high-performance GPU compute to the network edge.

This move follows a broader industry shift toward distributed AI grids that reduce the latency bottleneck of centralized data centers. For agentic AI systems requiring rapid multi-step reasoning, physical proximity is critical. It mirrors the pattern of moving inference to the edge to support real-time autonomous operations.

You can now deploy Large Language Models (LLMs) (AI trained to understand and generate text) and post-production pipelines in the us-west-2-lax-1b zone. This setup supports real-time 3D rendering and VFX composition with low-latency local storage. To start, opt-in via the AWS Global View console; instances are available through On Demand and Savings Plans.

Read the full update →

Frequently asked questions

What are Amazon EC2 G7e instances?
Amazon EC2 G7e instances are high-performance virtual machines designed for graphics-intensive and AI workloads. They feature NVIDIA RTX PRO 6000 Blackwell Server Edition GPUs and 5th generation Intel Xeon Scalable processors. These instances are optimized for tasks requiring a balance of powerful GPU compute and low-latency access, such as real-time rendering and deploying large language models.
Where are G7e instances available?
As of April 2026, Amazon EC2 G7e instances are generally available in AWS Local Zones in Los Angeles, California. Specifically, users can access them in the us-west-2-lax-1b zone. This geographic expansion allows businesses in the Southern California region to run demanding GPU workloads physically closer to their end users to minimize network latency.
What hardware powers the AWS G7e instances?
These instances are accelerated by NVIDIA RTX PRO 6000 Blackwell Server Edition GPUs, which provide advanced graphics and AI inference capabilities. They also utilize 5th generation Intel Xeon Scalable processors, also known as Emerald Rapids. This combination of hardware supports enhanced real-time rendering, 2D and 3D visual effects composition, and the execution of complex agentic AI loops.
How do I access G7e instances in the Los Angeles Local Zone?
To use G7e instances in Los Angeles, you must first opt-in to the us-west-2-lax-1b Local Zone through the AWS Global View console. Once enabled, you can launch and manage the instances using the Amazon EC2 console, AWS Command Line Interface, or AWS SDKs. They are available through standard On Demand pricing or via Savings Plans.
What are the primary use cases for G7e instances at the edge?
G7e instances are built for creative and AI workloads that require low latency. Creative uses include visual effects editorial, color correction, and post-production finishing. For AI, they are used to deploy large language models and agentic AI systems at the edge, where immediate response times are necessary for interactive applications and real-time data processing.