Moonshot AI Launches Kimi K2.6 with 4000 Step Long Horizon Coding

Kimi

Apr 20, 2026 · Updated May 9, 2026

Moonshot AI released Kimi K2.6, an open-source model that achieves state-of-the-art scores on SWE-Bench Pro and Toolathlon. The update introduces long-horizon coding, enabling agents to execute over 4,000 autonomous steps without losing context or drifting from the task.

Moonshot AI, an AI lab focused on long-context and agentic systems, released Kimi K2.6, a Mixture-of-Experts (MoE) model (an architecture that uses specialized sub-networks for efficiency). This version claims open-source state-of-the-art performance on SWE-Bench Pro (58.6), a result matching its top rank on OpenRouter for programming.

SWE-Bench Pro: 58.6
SWE-bench Multilingual: 76.7
Toolathlon: 50.0
Math Vision with Python: 93.2
Coding Horizon: 4,000+ steps
HLE with tools: 54.0

The update builds on the Kimi K2.5 foundation that previously powered Cursor Composer 2. By extending the coding horizon to over 4,000 steps, Moonshot AI addresses the primary bottleneck in agentic workflows: model failure during long migrations. This capability extends to a new agent swarm that coordinates hundreds of sub-agents.

You can integrate Kimi K2.6 into agentic workflows via the Kimi API, where it supports specialized tasks like math vision and multilingual coding. The model follows the release of FlashKDA to double prefill speeds, ensuring these intensive sessions maintain high throughput (data volume processed over time) without the latency typically associated with massive context.

View the full update on kimi-k2.org

Kimi.ai

@Kimi_MoonshotApr 20

Meet Kimi K2.6: Advancing Open-Source Coding 🔹Open-source SOTA on HLE w/ tools (54.0), SWE-Bench Pro (58.6), SWE-bench Multilingual (76.7), BrowseComp (83.2), Toolathlon (50.0), Charxiv w/ python(86.7), Math Vision w/ python (93.2) What's new: 🔹Long-horizon coding - 4,000+ https://t.co/wkzsQqKphv

1.2k10k

View on X

Still wondering? A few quick answers below.

Kimi K2.6 is a large-scale Mixture-of-Experts language model developed by Moonshot AI, which uses specialized sub-networks to increase efficiency. It is specifically optimized for agentic coding and complex reasoning tasks. The model features a trillion-parameter architecture designed to handle multi-step autonomous workflows and long-context interactions, positioning it as a leading open-weight alternative for developers.

Kimi K2.6 achieved state-of-the-art results across several specialized benchmarks. It scored 58.6 on SWE-Bench Pro for software engineering and 76.7 on SWE-bench Multilingual. Additionally, it reached 54.0 on Humanity's Last Exam with tools and 93.2 on Math Vision using Python, demonstrating high-level proficiency in both coding and technical reasoning compared to other open-weight models.

Long-horizon coding refers to the model's ability to maintain coherence and accuracy over extended autonomous tasks. Kimi K2.6 supports sequences of over 4,000 steps in a single coding session. This capability allows AI agents to complete complex, multi-file software migrations and debugging cycles without losing track of the original goal or drifting from the intended logic.

Kimi K2.6 is released as an open-weight model, making its parameters available for developers to use and build upon. While it is developed by Moonshot AI, its performance on benchmarks like SWE-Bench Pro establishes it as a top-tier open-source option for agentic coding, offering a transparent alternative to proprietary models like those from OpenAI or Anthropic.

Kimi K2.6 is available through the Kimi API provided by Moonshot AI. The model entered beta testing in April 2026, allowing developers to integrate its long-horizon coding and tool-use capabilities into their own applications. It is also designed to work with specialized infrastructure like cross-datacenter inference to manage the high compute demands of long-context requests.

Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →

See all AI news & updates from Kimi →

Keep reading

Moonshot AI Kimi K2.6 Claims Top Spot on OpenRouter Programming Leaderboard

Kimi K2.6 is now the highest-ranked model for programming on OpenRouter, outperforming established frontier models in coding tasks. This third-party validation confirms the model's effectiveness for complex software development workflows just days after its release.

Fireworks AI Adds Kimi K2.6 Training to Build Custom Frontier Agents

Fireworks AIApr 25

Fireworks AI Adds Kimi K2.6 Training to Build Custom Frontier Agents

Fireworks AI added Moonshot AI's Kimi K2.6 to its training platform, enabling supervised fine-tuning and reinforcement learning on the 1-trillion parameter model. This allows teams to customize the leading open-weight agentic model for specific production workflows while maintaining a 265K context window.

Arena.ai Ranks Kimi K2.6 as Top Open Model for Vision and Documents

ArenaApr 24

Arena.ai Ranks Kimi K2.6 as Top Open Model for Vision and Documents

Arena.ai confirmed that Moonshot AI's Kimi K2.6 is now the highest-ranked open model in both the Vision and Document Arenas. The model broke into the top 10 overall for document analysis, matching the performance of proprietary frontier models like Gemini 3.1 Pro.

Cloudflare Workers AI Adds Kimi K2.5 for End-to-End Agent Workflows

CloudflareMar 20

Cloudflare Workers AI Adds Kimi K2.5 for End-to-End Agent Workflows

Cloudflare's Workers AI now supports Kimi K2.5, Moonshot AI's frontier open-source model with a 256k context window. Developers can build and run full agent workflows on Cloudflare's platform, with prefix caching and a new async API cutting inference costs.

What is Kimi K2.6?

What are the benchmark scores for Kimi K2.6?

What is long-horizon coding in Kimi K2.6?

Is Kimi K2.6 open source?

How can I access Kimi K2.6?

Keep reading

Moonshot AI Kimi K2.6 Claims Top Spot on OpenRouter Programming Leaderboard

Moonshot AI Kimi K2.6 Claims Top Spot on OpenRouter Programming Leaderboard

Fireworks AI Adds Kimi K2.6 Training to Build Custom Frontier Agents

Fireworks AI Adds Kimi K2.6 Training to Build Custom Frontier Agents

Arena.ai Ranks Kimi K2.6 as Top Open Model for Vision and Documents

Arena.ai Ranks Kimi K2.6 as Top Open Model for Vision and Documents

Cloudflare Workers AI Adds Kimi K2.5 for End-to-End Agent Workflows

Cloudflare Workers AI Adds Kimi K2.5 for End-to-End Agent Workflows

Keep reading

Moonshot AI Kimi K2.6 Claims Top Spot on OpenRouter Programming Leaderboard

Moonshot AI Kimi K2.6 Claims Top Spot on OpenRouter Programming Leaderboard

Fireworks AI Adds Kimi K2.6 Training to Build Custom Frontier Agents

Fireworks AI Adds Kimi K2.6 Training to Build Custom Frontier Agents

Arena.ai Ranks Kimi K2.6 as Top Open Model for Vision and Documents

Arena.ai Ranks Kimi K2.6 as Top Open Model for Vision and Documents

Cloudflare Workers AI Adds Kimi K2.5 for End-to-End Agent Workflows

Cloudflare Workers AI Adds Kimi K2.5 for End-to-End Agent Workflows