Intel Arc Pro B70 Delivers 1.8x Performance Boost for Large AI Models

Intel NewsIntel News

· Updated

Intel's latest MLPerf Inference v6.0 benchmarks show the Arc Pro B70 GPU achieving nearly double the performance of previous generations. This hardware combination enables running 120B parameter models on workstation-class systems using an open software stack.

Intel released MLPerf Inference v6.0 results for its Intel Xeon 6 CPUs and Intel Arc Pro B-Series GPUs. The Intel Arc Pro B70 demonstrated up to 1.8x higher inference performance than the previous generation. A four-GPU setup provides 128GB of VRAM, enough to run 120B parameter models with high concurrency.

This update positions Intel as a viable alternative for developers seeking high-performance AI inference without proprietary lock-in. By offering 1.6x more KV cache capacity than comparable competitors, the hardware handles larger models and longer context windows. The open, containerized Linux stack aims to simplify enterprise-grade deployments.

You can now leverage these systems for local LLM inference and fine-tuning. The Intel Xeon 6 processors include built-in acceleration like AMX and AVX512, allowing some AI tasks to run efficiently without dedicated accelerators. These systems are designed for workstations where data privacy and cost-efficiency are priorities.

Intel News
Intel News
@intelnews
X

Demand for AI inference = demand for greater performance. See how today’s newly released MLPerf Inference v6.0 benchmarks show how #IntelXeon 6 CPUs and #IntelArcPro B-series GPUs deliver—with Intel Arc Pro B70 providing up to 1.8x higher inference performance over previous generations. Read more: https://t.co/94aZZbXsGf

19retweets88likes
View on X

Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →

Share this update