The first public foundation models from @poolsideai just dropped on OpenRouter! Laguna M.1 and Laguna XS.2. Built from scratch for agentic coding and long-horizon work. Free for a limited time ⬇️ https://t.co/toDCzLsbgu
OpenRouter Hosts Poolside AI Laguna Models for Free Agentic Coding
· Updated
OpenRouter added the first public models from Poolside AI, a frontier lab building foundation models for software agents, to its unified API. The release includes Laguna M.1, a flagship model for complex engineering, and Laguna XS.2, a 33B Mixture of Experts (an architecture that activates only a subset of parameters per token) model.
- Context window
- 128K tokens
- Output limit
- 8K tokens
- Architecture (XS.2)
- 33B Mixture of Experts
- Active parameters (XS.2)
- 3B tokens
- Precision
- fp8 quantized
- Pricing
- Free for a limited time
While most developers use general-purpose models for coding, these models are built from scratch for the agentic loop. They prioritize tool calling and reasoning over chat fluency to handle multi-step tasks. This release follows a surge in specialized reasoning models and long-context agentic tools appearing on the platform.
Access both models via the OpenRouter API or download the weights from Hugging Face. Both feature a 128K context window and are quantized to fp8 (a 8-bit floating point format for faster inference) to reduce latency. Access on OpenRouter is currently free for a limited time.
OpenRouter
@OpenRouter
20retweets249likes
View on XStill wondering? A few quick answers below.
Laguna M.1 and Laguna XS.2 are the first public foundation models from Poolside AI, a lab focused on software engineering agents. Unlike general-purpose models, these are built from scratch for agentic coding and long-horizon tasks. They prioritize tool calling and reasoning capabilities to help autonomous agents navigate complex codebases and execute multi-step software development workflows.
Laguna XS.2 uses a 33B parameter Mixture of Experts architecture, where only 3B parameters are active during any single inference turn. This design allows the model to maintain high reasoning performance for coding tasks while remaining computationally efficient. It is quantized to fp8 precision, which further optimizes it for fast and cost-effective execution in agentic loops.
While not strictly open source in terms of training data, Poolside has released the model weights for both Laguna M.1 and Laguna XS.2 on Hugging Face. This open-weight release allows developers to download and run the models on their own infrastructure. Additionally, the models are available for public use via the OpenRouter API for integrated workflows.
Both Laguna models feature a 128,000 token context window, allowing them to process large segments of code or multiple files in a single request. They also support an output limit of up to 8,000 tokens. This large capacity is specifically designed for long-horizon work where an agent needs to maintain state across complex engineering tasks.
You can access Laguna M.1 and Laguna XS.2 through the OpenRouter API, where they are currently available for free for a limited time. Developers can also find the model weights on Hugging Face for local deployment. The models are optimized for integration into coding agents, IDEs, and CI/CD workflows that require specialized reasoning for software engineering.



