Cohere Releases Open Source Transcribe Model Outperforming Whisper on Accuracy

Cohere

Mar 28, 2026 · Updated Apr 25, 2026

Cohere launched Transcribe, a 2-billion parameter open-source speech recognition model that currently holds the top spot on the HuggingFace Open ASR Leaderboard. By achieving a 5.42% word error rate, it provides a high-accuracy, high-throughput alternative for enterprise workflows that previously relied on larger or proprietary models.

Cohere released Cohere Transcribe, an open-source speech recognition model under the Apache 2.0 license. This 2-billion parameter model uses a Conformer-based architecture to transcribe 14 languages. It currently ranks first for English accuracy on the HuggingFace Open ASR Leaderboard, outperforming Whisper Large v3 and ElevenLabs Scribe v2.

High-performance transcription usually requires massive models or expensive closed APIs. This model shifts the trade-off between accuracy and cost by delivering a superior accuracy-to-speed ratio in a smaller footprint. It enables state-of-the-art transcription on consumer-grade GPUs or at the edge without the high error rates typical of lightweight models.

Download the weights for cohere-transcribe-03-2026 for local deployment or use the API for experimentation. For production needs, the model is available through Model Vault. Future updates will integrate the model into North, the company’s agent orchestration platform, to power voice-enabled enterprise agents and real-time support workflows.

View the full update on cohere.com

Cohere

@cohereMar 26

Introducing: Cohere Transcribe – a new state-of-the-art in open source speech recognition. https://t.co/l87Z6oyQdM

264

View on X

Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →

See all AI news & updates from Cohere →

Keep reading

Cohere Launches Command A+ to Bring Frontier Agentic AI to Private Hardware

Cohere released Command A+, a 218-billion parameter open-source model optimized for complex reasoning and multimodal agentic tasks. By achieving high performance on as little as two H100 GPUs, the model allows enterprises to deploy frontier-class agents entirely within their own private infrastructure.

Cohere Releases North Mini Code, a Small Open-Weight Model for Coding

Artificial AnalysisJun 10

Cohere Releases North Mini Code, a Small Open-Weight Model for Coding

Cohere released North Mini Code, a small 30B parameter (3B active) open weights coding model. This model achieves competitive coding performance for its size and speed, positioning it as a focused option in the open-weight ecosystem.

Mistral AIMar 28

Mistral AI Launches Voxtral TTS to Challenge Proprietary Models with Open Weights

Mistral AI launched Voxtral TTS, a 4B-parameter text-to-speech model capable of zero-shot voice cloning from just three seconds of audio. By offering frontier-grade emotional expressiveness and low latency in an open-weight format, it provides a high-performance alternative to closed-source providers for building real-time voice agents.

Together AI Launches Unified Voice Agent Cloud With Full Pipeline Co-Location

Together AIMar 18

Together AI Launches Unified Voice Agent Cloud With Full Pipeline Co-Location

Together AI launched a unified platform for real-time voice agents with STT, LLM, and TTS co-located on one cloud. Most voice stacks route audio across separate vendors — Together keeps all three in the same cluster, hitting latency under 700ms.