Xiaomi Launches MiMo-V2.5 Series With 1M Context and Reasoning Tokens

OpenRouterOpenRouter

· Updated

Xiaomi released the MiMo-V2.5 series on OpenRouter, featuring a 1 million token context window and native multimodal support for image and video tasks. The models are specifically architected for long-horizon agentic workflows and coding, offering reasoning-enabled thinking tokens to improve task stability. By delivering pro-level performance at roughly half the typical inference cost, these models lower the economic barrier for deploying autonomous agents at scale.

Xiaomi, a consumer electronics and software company, launched the MiMo-V2.5 series on OpenRouter with native omnimodal capabilities for text, images, and video. Both variants feature a 1M token context window and support reasoning tokens, allowing the model to perform internal deliberation before generating a final response.
Context window
1,048,576 tokens
Max output tokens
131,072 tokens
Pricing (input)
$0.40 per million tokens
Pricing (output)
$2.00 per million tokens
Modality
Native omnimodal (text, image, video)
Availability
OpenRouter API

This release extends the availability of Xiaomi's agent-centric models as developers shift toward long-running autonomous workflows. By optimizing for agentic performance and coding stability, the series addresses reliability issues in multi-step tasks. It follows a trend of optimizing models specifically for agent pipelines.

You can integrate mimo-v2.5 into agent frameworks to handle massive codebases in a single pass. The model is available via the OpenRouter API at $0.40 per million input tokens and $2 per million output tokens. To use reasoning, enable the reasoning parameter and preserve the reasoning_details array.

OpenRouter
OpenRouter
@OpenRouter
X

MiMo-V2.5 series from @XiaomiMiMo is live now on OpenRouter! Both V2.5 and V2.5-Pro see improvements over V2-Pro and V2-Omni, with a focus on long running agent tasks and coding ability. They also both launch with 1 million context, and are extremely token efficient. https://t.co/FQUluEFLb7

32retweets472likes
View on X

Still wondering? A few quick answers below.

MiMo-V2.5 is priced at $0.40 per million input tokens and $2.00 per million output tokens on the OpenRouter platform. This pricing structure is designed to be highly token-efficient, offering pro-level performance for agentic tasks at approximately half the inference cost of comparable high-end models in the current market.

Both MiMo-V2.5 and MiMo-V2.5-Pro feature a massive context window of 1,048,576 tokens. This 1M token capacity allows the models to process entire codebases, lengthy documents, and extended conversation histories in a single pass, which is particularly useful for complex agentic workflows that require maintaining a large amount of state.

MiMo-V2.5 supports reasoning tokens, which are internal thinking tokens the model generates to work through logic before providing a final response. Users can enable this by using the reasoning parameter in their API request. Accessing the reasoning details array allows developers to see the step-by-step internal thinking process used by the model.

MiMo-V2.5 is a native omnimodal model, meaning it was built to understand multiple types of data within a single architecture. It specifically shows improvements in multimodal perception for image and video understanding tasks, surpassing previous versions like MiMo-V2-Omni. This makes it suitable for agents that need to interpret visual information alongside text.

The MiMo-V2.5 series is specifically optimized for long-running agent tasks and advanced coding. Its combination of a 1M context window, reasoning capabilities, and high token efficiency makes it an ideal engine for autonomous agent frameworks. It excels at tasks requiring complex instruction decomposition, stable tool use, and deep reasoning over large datasets.

Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →

Share this update