OpenRouter launches GPT-5.4 Image 2 to unify frontier reasoning and visual generation

OpenRouter

Apr 24, 2026 · Updated May 2, 2026

OpenRouter added OpenAI's gpt-5.4-image-2 to its unified API, combining the latest reasoning model with high-fidelity image generation. This update allows developers to build workflows where the model refines its own visual prompts to produce more accurate, instruction-aligned assets.

OpenRouter, a platform providing a unified API for accessing hundreds of AI models, launched openai/gpt-5.4-image-2. This multimodal model integrates the advanced reasoning of OpenAI's latest frontier model with the high-fidelity output of the GPT Image 2 engine to enable complex visual workflows.

Context window: 272,000 tokens
Max output: 128,000 tokens
Pricing (input): $8 per million tokens
Pricing (output): $15 per million tokens
Pricing (image output): $30 per million tokens
Base model: GPT-5.4
Image engine: GPT Image 2

This release mirrors the industry shift toward functional precision seen in ChatGPT Images 2.0. By using the OpenAI Responses API, the model acts as an orchestrator that calls an internal image generation tool. This allows GPT-5.4 to technically refine user prompts, ensuring higher adherence to complex instructions.

Access the model via the OpenRouter API by specifying image and text modalities in your requests. Pricing is $8 per million input tokens and $15 per million output tokens, with generated images costing $30 per million tokens. The model features a 272,000-token context window.

View the full update on openrouter.ai

OpenRouter

@OpenRouterApr 22

The incredibly powerful new GPT Image 2 model from @OpenAI is live on OpenRouter as openai/gpt-5.4-image-2 Using OpenRouter's GPT-5.4 Image 2, you get the image gen capabilities of Image 2 alongside the prompt improvement of GPT-5.4. https://t.co/RelMC3zKwQ

11193

View on X

Still wondering? A few quick answers below.

GPT-5.4 Image 2 is a multimodal model from OpenAI that combines the reasoning capabilities of GPT-5.4 with the high-quality visual generation of GPT Image 2. It allows users to perform complex tasks involving text, code, and images within a single interaction, using the language model to refine prompts for better visual accuracy.

The model operates by calling the OpenAI Responses API, where the GPT-5.4 model acts as an orchestrator with access to an Image Generation server tool. When users specify both image and text modalities in their request, the model can generate images from text prompts and return them as base64-encoded data URLs.

OpenRouter charges $8 per million input tokens and $15 per million output tokens for text processing. For visual tasks, image inputs are processed as part of the prompt, while image outputs are priced at $30 per million tokens. These rates allow developers to access OpenAI's frontier multimodal capabilities through a unified API.

GPT-5.4 Image 2 features a large 272,000-token context window, which allows it to process extensive documents or complex multi-image prompts in a single session. The model supports a maximum output of 128,000 tokens, providing significant headroom for generating long-form text alongside high-resolution visual assets in multimodal workflows.

The model is currently live and available to all developers using the OpenRouter platform. Users can access it through the unified API or test it directly in the OpenRouter Chat interface. It is designed for production environments requiring a balance of advanced reasoning, instruction following, and high-quality image generation at scale.

Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →

See all AI news & updates from OpenRouter →

Keep reading

OpenRouter launches GPT-5.5 Pro with inspectable reasoning tokens for agentic workflows

OpenRouter integrated OpenAI's GPT-5.5 and GPT-5.5 Pro into its unified API, featuring a 1.05 million token context window. The Pro variant introduces a dedicated reasoning parameter that allows developers to monitor and preserve the model's internal thinking process during complex, multi-step tasks.

OpenAI Launches GPT-5.4 With Native Computer Use and Mid-Response Steering

OpenAIMar 5

OpenAI Launches GPT-5.4 With Native Computer Use and Mid-Response Steering

OpenAI released GPT-5.4, its most capable frontier model combining advanced reasoning, coding, and agentic workflows. It's the first general-purpose model with native computer-use capabilities, and ChatGPT users can now steer its thinking mid-response to refine outputs without starting over.

ComfyUIMay 30

ComfyUI Adds OpenRouter for Unified Access to Frontier Creative Models

ComfyUI launched an official OpenRouter LLM partner node, enabling direct access to over 20 frontier and open-weight models within its visual orchestration platform. The integration dynamically reconfigures its interface based on model capabilities, allowing creators to swap between vision, reasoning, and web-grounded models without rebuilding workflows.

What is GPT-5.4 Image 2?

How does GPT-5.4 Image 2 generate images via the API?

What is the pricing for GPT-5.4 Image 2 on OpenRouter?

What are the context window and output limits for this model?

Who can use GPT-5.4 Image 2 on OpenRouter?

Keep reading

OpenRouter launches GPT-5.5 Pro with inspectable reasoning tokens for agentic workflows

OpenRouter launches GPT-5.5 Pro with inspectable reasoning tokens for agentic workflows

OpenAI Launches GPT-5.4 With Native Computer Use and Mid-Response Steering

OpenAI Launches GPT-5.4 With Native Computer Use and Mid-Response Steering

ComfyUI Adds OpenRouter for Unified Access to Frontier Creative Models

ComfyUI Adds OpenRouter for Unified Access to Frontier Creative Models

Keep reading

OpenRouter launches GPT-5.5 Pro with inspectable reasoning tokens for agentic workflows

OpenRouter launches GPT-5.5 Pro with inspectable reasoning tokens for agentic workflows

OpenAI Launches GPT-5.4 With Native Computer Use and Mid-Response Steering

OpenAI Launches GPT-5.4 With Native Computer Use and Mid-Response Steering

ComfyUI Adds OpenRouter for Unified Access to Frontier Creative Models

ComfyUI Adds OpenRouter for Unified Access to Frontier Creative Models