AI SDK Adds OpenAI WebSocket Transport for 40% Faster Streaming

OpenAI

Feb 23, 2026 · Updated Apr 25, 2026

Vercel's AI SDK added OpenAI WebSocket transport that cuts time to first byte by up to 40%. Swap in a WebSocket fetch adapter and streaming responses arrive faster by skipping repeated connection handshakes.

AI SDK, Vercel's open-source toolkit for building AI applications, added WebSocket transport for OpenAI connections. Instead of opening a new HTTP connection for each API call, WebSockets maintain a persistent connection that eliminates TCP and TLS handshake overhead on every request - cutting time to first byte by up to 40%.

The gain compounds for apps that make frequent OpenAI calls — chat interfaces and agent loops pay TCP and TLS handshake costs on every HTTP request, and those stack up across a conversation. A persistent WebSocket connection absorbs that overhead once, so the latency benefit grows with call frequency.

Drop the WebSocket transport into an existing AI SDK project — streamText, message handling, and response formatting stay unchanged, and the persistent connection delivers the latency improvement.

View the full update on ai-sdk.dev

AI SDK

@aisdkFeb 23

Use OpenAI WebSockets with the AI SDK to save up to 40% time to first byte. https://t.co/7gNm7ut1YK

View on X

Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →

See all AI news & updates from OpenAI →

Keep reading

OpenAI Speeds Up Agentic Loops With Persistent WebSocket Connections

OpenAI introduced WebSocket support for its Responses API to eliminate the latency overhead of traditional HTTP requests in multi-step agent workflows. By maintaining a persistent connection and caching conversation state in memory, the system allows coding agents to run up to 40% faster end to end.

Vercel Chat SDK Adds Web Adapter to Bring Agents into the Browser

VercelMay 13

Vercel Chat SDK Adds Web Adapter to Bring Agents into the Browser

Vercel updated its Chat SDK with a new web adapter that allows developers to stream agent responses directly into React applications using the standard useChat hook. This update enables teams to maintain a single backend for AI assistants that live simultaneously on their own websites and third-party platforms like Slack or Messenger.

Vercel Open-Sources Chat SDK for Multi-Platform Bot Development

Guillermo RauchFeb 24

Vercel Open-Sources Chat SDK for Multi-Platform Bot Development

Vercel open-sourced Chat SDK, a TypeScript library for building chatbots across Slack, Teams, Google Chat, Discord, Telegram, GitHub, and Linear from one codebase. It includes AI streaming for LLM responses and JSX cards that render natively per platform.

Cloudflare Launches Experimental Voice Pipeline for Real-Time Agent Interactions

CloudflareApr 16

Cloudflare Launches Experimental Voice Pipeline for Real-Time Agent Interactions

Cloudflare introduced the @cloudflare/voice package, an experimental extension for its Agents SDK that enables bidirectional voice communication over WebSockets. By unifying voice and text state within a single Durable Object, developers can build multimodal agents that maintain context across different interaction channels.