HeadsUpAI

Z.ai Extends Triple Usage Quotas for GLM-5 Series Through June

Z.ai, the AI lab behind the GLM model series, extended its triple usage promotion for GLM-5.1 and GLM-5-Turbo through June 30. This incentive triples the available capacity for users on the GLM Coding Plan during all hours except the 2–6 AM ET peak window.
Promotion end date
June 30, 2026
Models included
GLM-5.1, GLM-5-Turbo
Usage multiplier
3x standard capacity
Peak window (standard rates)
2–6 AM ET
Required plan
GLM Coding Plan

The extension supports the high token demands of Zhipu AI's GLM-5.1 flagship, which is designed for long-horizon tasks requiring autonomous work sessions. By maintaining these limits, the lab lowers the cost barrier for complex agentic workflows. This move mirrors a broader trend of OpenRouter's extended model access to incentivize testing of frontier models.

You can utilize the increased limits by selecting the GLM-5 series in supported coding agents like Claude Code, Cline, or Cursor. The promotion applies automatically to the GLM Coding Plan, providing the same high-volume capacity previously reserved for the older GLM-4.7 model. Standard quotas still apply during the 2–6 AM ET peak window.

Z.ai
Z.ai
@Zai_org
X

The "triple usage" period for GLM-5.1 and GLM-5-Turbo is now extended to June 30. Availability: Anytime except 2-6 AM ET.

44retweets808likes
View on X

Still wondering? A few quick answers below.

The triple usage period for the GLM-5.1 and GLM-5-Turbo models has been extended through June 30, 2026. This promotion was originally scheduled to conclude on April 30, but users now have an additional two months of high-volume capacity to test and deploy these models for complex agentic engineering tasks.

This extension specifically applies to the GLM-5.1 and GLM-5-Turbo models. GLM-5.1 is the flagship model designed for long-horizon tasks that require autonomous execution for several hours, while GLM-5-Turbo is a faster variant optimized for agent workflows. Both models are part of the latest GLM-5 series from the Z.ai lab.

The tripled usage limits are available at all times except during the peak window of 2 AM to 6 AM ET. During these four hours, standard usage rates apply. Outside of this specific early morning window, users on the GLM Coding Plan can access three times their normal capacity for the GLM-5 series.

Eligibility for the tripled usage limits is restricted to users subscribed to the GLM Coding Plan. This plan is designed for developers using coding agents like Claude Code, Cline, or Cursor. To use the promotion, subscribers must manually configure their coding tools to use the GLM-5.1 or GLM-5-Turbo model identifiers.

This promotion provides the GLM-5 series with the same high-volume capacity previously available for the older GLM-4.7 model during non-peak hours. By tripling the standard limits, Z.ai allows developers to perform more intensive debugging and refactoring sessions without hitting the rate limits typically associated with newer, more capable frontier models.

Share this update