HeadsUpAI

OpenRouter adds MiniMax-M3 with 1M context for multimodal agentic coding

OpenRouter added MiniMax-M3, a multimodal foundation model (a base AI system trained on broad data) built for long-horizon agentic tasks. It supports a 1-million-token context window and native video inputs. Using MiniMax Sparse Attention, it replaces full attention with KV-block selection, cutting long-context compute costs by 95%.
Context Window
1,048,576 tokens
Max Output
512,000 tokens
Input Modalities
Text, Image, Video
Architecture
MiniMax Sparse Attention (MSA)
Pricing (Promo)
$0.30/M input, $1.20/M output

This release extends the agentic trajectory of MiniMax M2.7 into multimodal territory. It follows the pattern of DeepSeek-V4 in making 1M-token context the standard for autonomous agents. Native multimodality enables reasoning across interleaved text, image, and video data during complex, multi-step workflows.

Access minimax/minimax-m3 via OpenRouter at a 50% discount through June 7, 2026, priced at $0.30 per million input tokens. The model supports a reasoning parameter to expose internal thinking tokens and is optimized for multi-turn collaboration. Open weights are available on Hugging Face.

OpenRouter
OpenRouter
@OpenRouter
X

MiniMax-M3 is live on OpenRouter! A frontier-class open-weight model that combines a 1M-token context window, frontier coding and agentic performance, and native multimodality (image & video) in one model. https://t.co/ocxd2OSYkk

21retweets431likes
View on X

Still wondering? A few quick answers below.

MiniMax-M3 is a multimodal foundation model designed for long-horizon agentic tasks and coding. It features a 1-million-token context window and native support for text, image, and video inputs, allowing it to reason across massive datasets in a single inference loop.

MiniMax Sparse Attention (MSA) replaces traditional full attention mechanisms with KV-block selection. This architectural shift reduces the computational resources required for long-context processing, cutting per-token compute costs by approximately 95% compared to previous model generations while maintaining output quality.

The model is available via the OpenRouter API as minimax/minimax-m3, featuring a 50% discount during its launch week. Additionally, MiniMax has released the model weights on Hugging Face, allowing developers to download and run the model on their own infrastructure.

Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →

Share this update