Zhipu AI

Zhipu AI AI News & Updates

The latest AI news and updates of Zhipu AI — AI lab behind the GLM model series and the Z.ai developer platform. Covering Zhipu AI's latest product updates and launches from the past 90 days.

Zhipu AIZhipu AIMay 21

Z.ai Deploys ZCube Network to Slash Inference Costs and Latency

Z.ai successfully deployed its ZCube network architecture in production to power GLM-5.1 coding services, reducing hardware costs by 33% while boosting throughput. By flattening the network topology, the system eliminates the congestion typically caused by moving massive amounts of data between GPUs during long-context inference.

Read more
Zhipu AIZhipu AIMay 8

Zhipu AI Details GLM-5V-Turbo Architecture for Native Multimodal Agents

Zhipu AI released a technical report for GLM-5V-Turbo, detailing the architecture and training methods behind its multimodal agent capabilities. The report highlights how native vision integration and scaled reinforcement learning enable the model to perceive and act across complex GUIs and coding tasks.

Zhipu AIZhipu AIApr 30

Z.ai Resolves GLM-5 Infrastructure Race Conditions to Stabilize Long Horizon Coding Agents

Z.ai identified and fixed low-level race conditions in its GLM-5 inference stack that caused garbled outputs and repetition during high-concurrency coding tasks. By introducing a layer-wise cache storage scheme called LayerSplit, the lab also increased system throughput by up to 132% for long-context workloads.

Zhipu AIZhipu AIApr 27

Z.ai Extends Triple Usage Quotas for GLM-5 Series Through June

Z.ai extended its high-volume usage promotion for the GLM-5.1 and GLM-5-Turbo models until June 30, 2026. This extension allows developers to run intensive agentic engineering tasks with three times the standard capacity during non-peak hours.

Zhipu AIZhipu AIApr 17

Zhipu AI releases GLM-5.1 template fix to stop infinite tool calling loops

Zhipu AI released an updated chat template for the GLM-5.1 model series to fix a critical bug in tool calling. The previous template failed to parse array-formatted tool outputs from frameworks like vLLM, causing models to ignore results and repeat calls indefinitely.

Zhipu AIZhipu AIApr 7

Zhipu AI Launches GLM-5.1 to Enable Eight Hour Autonomous Engineering Sessions

Zhipu AI released GLM-5.1, an open-source model that ranks first among open-weights systems on major coding benchmarks like SWE-Bench Pro. The model is designed for long-horizon tasks, capable of running autonomously for eight hours to refine strategies and execute thousands of tool calls.

Zhipu AIZhipu AIApr 1

Zhipu AI launches GLM-5V-Turbo to bridge visual design and autonomous coding

Zhipu AI released GLM-5V-Turbo, a multimodal foundation model specifically architected for vision-based coding and GUI agent workflows. It natively understands images, videos, and design drafts to automate frontend recreation and visual debugging without degrading text-based reasoning.

Zhipu AIZhipu AIMar 31

Z.ai Launches AutoClaw for One Click Local AI Agent Deployment

Z.ai released AutoClaw, a local setup tool for the OpenClaw assistant framework that requires no API keys and keeps all data on-device. By pairing the tool with the new GLM-5-Turbo model, users can run complex multi-step agentic workflows entirely within their own infrastructure.

Frequently asked questions

Zhipu AI is AI lab behind the GLM model series and the Z.ai developer platform. HeadsUpAI tracks Zhipu AI across the AI ecosystem and curates every significant update — the latest being "Z.ai Deploys ZCube Network to Slash Inference Costs and Latency" (May 21, 2026) — so you get the whole story in a 30-second read.

The most recent Zhipu AI update is "Z.ai Deploys ZCube Network to Slash Inference Costs and Latency" (May 21, 2026). HeadsUpAI curates every significant Zhipu AI release as a 30-second read — what shipped and why it matters.

The latest Zhipu AI updates: "Z.ai Deploys ZCube Network to Slash Inference Costs and Latency", "Zhipu AI Details GLM-5V-Turbo Architecture for Native Multimodal Agents", "Z.ai Resolves GLM-5 Infrastructure Race Conditions to Stabilize Long Horizon Coding Agents", "Z.ai Extends Triple Usage Quotas for GLM-5 Series Through June", and "Zhipu AI releases GLM-5.1 template fix to stop infinite tool calling loops". HeadsUpAI has curated 8 Zhipu AI updates over the last 90 days, covering product updates and launches — listed newest first, presented straight, no hype, no bias.

Zhipu AI is AI lab behind the GLM model series and the Z.ai developer platform. On this page you'll find every significant Zhipu AI development HeadsUpAI has tracked recently — product updates and launches — so you can keep up with where Zhipu AI is heading without reading a dozen sources.

Continuously. HeadsUpAI adds new Zhipu AI updates as they're announced — usually within hours — and the 8 updates currently shown cover the past 90 days, newest first.