Vercel AI CLI v0.3.0 Adds Multi-Image Inputs for Terminal Workflows

Vercel

May 31, 2026 · Updated Jun 12, 2026

Vercel released version 0.3.0 of its AI CLI, introducing support for multiple reference image inputs and vision-capable stdin detection. The update allows users and AI agents to perform complex visual tasks like style transfer and product referencing directly from the command line.

Vercel released version 0.3.0 of its AI CLI, a terminal tool for generating text, images, and video. The update adds multi-image input support via the -i flag and automatic detection for images piped through stdin (the standard input stream for terminal data). This enables multimodal engineering directly in the terminal.

Version: 0.3.0
New Flag: -i / --image
Input Support: Multi-image and stdin
Node.js Requirement: 20 or higher
Install Command: npm install -g ai-cli

Referencing multiple files allows for precise workflows like style transfer and visual comparisons. By providing a programmable skill for coding agents to analyze or generate assets without leaving the command line, the tool fills a gap in agentic workflows. It allows agents to review UI screenshots or generate product assets as discrete, automated steps.

You can now combine subject images with style references or use sketches to guide product generation. The tool supports hundreds of models via the Vercel AI Gateway with inline previews for compatible terminals. Version 0.3.0 is available now via npm install -g ai-cli and requires Node.js 20 or higher.

View the full update on github.com

Chris Tate

@ctatedevMay 31

Now available in AI CLI Multi-image inputs → style transfer → product references → before / after comparisons Install: npm install -g ai-cli Example: ai image -i map.png -i grid.png -o map+grid.png "overlay grid on map" https://t.co/6BoEi53OA5

View on X

Still wondering? A few quick answers below.

Vercel AI CLI is a lightweight, terminal-based tool designed for generating text, images, and video. It uses the Vercel AI SDK and AI Gateway to provide a unified interface for hundreds of different AI models, allowing both humans and autonomous agents to trigger generations using standard command-line patterns.

In version 0.3.0, you can use the -i or --image flag multiple times in a single command to provide several reference images to a model. This is useful for tasks like style transfer, where you might provide one image for the subject and another for the desired aesthetic.

Yes, the v0.3.0 update introduces automatic vision stdin detection. This means you can pipe image data directly from another command into the AI CLI. For example, you can use a command to output an image and pipe it into the text command to have a vision-capable model describe it.

The tool connects to hundreds of models across various providers through the Vercel AI Gateway. Users can specify models using the -m flag with the creator and model name. If a model does not support specific features like multi-image input, the CLI will return an error from that provider.

Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →

See all AI news & updates from Vercel →

Keep reading

Vercel Launches AI CLI to Give Agents Native Multimodal Generation Powers

Vercel released AI CLI, a command-line tool that enables humans and AI agents to generate text, images, and video through a unified interface. By treating multimodal generation as a standard terminal utility, the tool allows agents to pipe outputs between different models to automate complex creative workflows.

Vercel CLI Adds Programmatic Flag Management for AI Agents

Guillermo RauchMar 14

Vercel CLI Adds Programmatic Flag Management for AI Agents

Vercel now lets you create and manage feature flags directly from the terminal using the new vercel flags CLI. The Flags SDK skill enables coding agents to generate flags through natural language prompts without accessing the dashboard.

What is Vercel AI CLI?

How do I use multiple images in AI CLI?

Does AI CLI support image piping?

Which models work with AI CLI?

Keep reading

Vercel Launches AI CLI to Give Agents Native Multimodal Generation Powers

Vercel Launches AI CLI to Give Agents Native Multimodal Generation Powers

Vercel CLI Adds Programmatic Flag Management for AI Agents

Vercel CLI Adds Programmatic Flag Management for AI Agents

Keep reading

Vercel Launches AI CLI to Give Agents Native Multimodal Generation Powers

Vercel Launches AI CLI to Give Agents Native Multimodal Generation Powers

Vercel CLI Adds Programmatic Flag Management for AI Agents

Vercel CLI Adds Programmatic Flag Management for AI Agents