Perplexity Computer announces hybrid inference to balance local privacy and cloud power

Perplexity

Jun 2, 2026 · Updated Jun 12, 2026

Perplexity is launching a hybrid agentic inference system that automatically routes tasks between on-device models and cloud-based frontier models. The update allows sensitive data to remain local while utilizing server-side compute for complex reasoning.

Perplexity is announcing hybrid agentic inference for its Personal Computer platform. This orchestration layer automatically splits workloads between local and cloud environments. The system uses a compact local model to identify sensitive data—like financial records—and processes it on-device, while routing complex reasoning to frontier models in the cloud.

Availability: July 2026
Local Hardware Support: Intel and NVIDIA RTX Spark
Orchestration Logic: Automatic task-by-task routing
Primary Benefit: Local privacy for sensitive data
Cloud Integration: Frontier models in the cloud

This shift addresses efficiency by reserving expensive server-side inference for work that genuinely requires it. By orchestrating compute location, Perplexity reduces dependency on centralized infrastructure. It positions user hardware as a private data center, mirroring on-device agent efforts like Google's Gemma 4 to balance privacy with frontier-level performance.

Coming in July 2026, the hybrid system will run across local silicon, including Intel chips and NVIDIA RTX Spark hardware. Users will not need to manually toggle modes; the orchestrator handles routing automatically based on task complexity and data sensitivity. This model-agnostic harness ensures capable models are used only when necessary.

View the full update on perplexity.ai

Perplexity

@perplexity_aiJun 2

Today we're announcing that hybrid agentic inference is coming to Perplexity Computer. Computer can split tasks between a local model running on your machine and frontier models in the cloud. This keeps private data on your device and maximizes token efficiency. Coming soon. https://t.co/6t3PrmI1FX

2012.2k

View on X

Still wondering? A few quick answers below.

It is a system that splits AI tasks between your local device and cloud servers. It uses local models for privacy-sensitive data and cloud-based frontier models for complex reasoning, optimizing for both security and performance without requiring manual user intervention.

The system is built on a model-agnostic framework designed to run across various local silicon. At launch, it supports Intel processors and NVIDIA RTX Spark hardware, allowing the orchestrator to utilize the specific AI acceleration capabilities of the user's machine.

Perplexity plans to release the hybrid local-server inference orchestrator in July 2026. It will be integrated into the existing Personal Computer platform, which currently allows users to orchestrate tasks across local files, native applications, and web browsers.

Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →

See all AI news & updates from Perplexity →

Keep reading

Perplexity Launches Personal Computer for Local File and App Orchestration

Perplexity released Personal Computer, a feature for its Mac app that allows the AI to orchestrate tasks across local files, native applications, and web browsers. This update transforms the answer engine into a proactive digital worker capable of executing multi-step workflows directly on a user's hardware.

Vercel Deployments Now Managed by Perplexity Computer in Natural Language

VercelJun 5

Vercel Deployments Now Managed by Perplexity Computer in Natural Language

Vercel is now available within Perplexity Computer, allowing users to connect their Vercel account and manage deployments. This integration enables developers to inspect projects, diagnose build failures, and trigger redeploys using natural language commands, extending AI agent capabilities into cloud development workflows.

What is hybrid agentic inference in Perplexity Computer?

Which hardware supports Perplexity's local inference?

When will hybrid inference be available for Perplexity users?

Keep reading

Perplexity Launches Personal Computer for Local File and App Orchestration

Perplexity Launches Personal Computer for Local File and App Orchestration

Vercel Deployments Now Managed by Perplexity Computer in Natural Language

Vercel Deployments Now Managed by Perplexity Computer in Natural Language

Keep reading

Perplexity Launches Personal Computer for Local File and App Orchestration

Perplexity Launches Personal Computer for Local File and App Orchestration

Vercel Deployments Now Managed by Perplexity Computer in Natural Language

Vercel Deployments Now Managed by Perplexity Computer in Natural Language