Introducing GLM-5.1: The Next Level of Open Source - Top-Tier Performance: #1 in open source and #3 globally across SWE-Bench Pro, Terminal-Bench, and NL2Repo. - Built for Long-Horizon Tasks: Runs autonomously for 8 hours, refining strategies through thousands of iterations. https://t.co/YQZLhKVwik
Zhipu AI Launches GLM-5.1 to Enable Eight Hour Autonomous Engineering Sessions
Zhipu AI· Updated
Zhipu AI released GLM-5.1, an open-source model that ranks first among open-weights systems on major coding benchmarks like SWE-Bench Pro. The model is designed for long-horizon tasks, capable of running autonomously for eight hours to refine strategies and execute thousands of tool calls.
GLM-5.1 as a flagship open-source model for agentic coding (AI that autonomously writes and iterates on code). It ranks first in open source and third globally on SWE-Bench Pro, a benchmark (standardized test for measuring model capabilities), scoring 58.4.The update shifts the focus from simple code generation to long-horizon autonomy. While most models struggle with multi-step reasoning over time, this model is built to run for eight hours straight. It uses a self-review loop to refine features and interactions, effectively bridging the gap between open-weights models and top-tier proprietary systems.
You can use the model for complex technical workflows like database optimization, where it has demonstrated a 6x performance boost over standard 50-turn sessions. GLM-5.1 is available now through OpenRouter, Vercel AI Gateway, and Requesty, with model weights hosted on Hugging Face for local deployment.
Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →


