Seven new models launching at Build: let’s go! Reasoning. Code. Image. Transcribe. Voice. Built from scratch on a clean data lineage, designed for efficiency, working seamlessly as a family of models Thread 🧵 #MSBuild https://t.co/g3WQIcIQ24
Microsoft AI launches MAI model family for private enterprise workflow tuning
MAI-Thinking-1 reasoning model matches Claude Sonnet 4.6 in human preference evaluations.- MAI-Code-1-Flash parameters
- 5 billion
- MAI-Code-1-Flash SWE-Bench Verified
- 71.6
- MAI-Transcribe-1.5 speed
- 1 hour audio in under 15 seconds
- MAI-Image-2.5-Flash input price
- $1.75 per 1M tokens
- MAI-Voice-2 language support
- 15 languages
This launch signals a move toward self-sufficiency, co-designing models with Maia 200 silicon for a 1.4x efficiency boost. By avoiding distillation, Microsoft AI scales performance via its own compute and data. The MAI-Image-2.5 release already validates this shift, securing a top-three spot on the Arena image leaderboard for text-to-image generation.
Organizations can use Frontier Tuning to adapt models to workflows using reinforcement learning (learning through trial and error) in private environments. MAI-Code-1-Flash is rolling out in GitHub Copilot, while multimodal models are available via Microsoft Foundry. A specialized healthcare model co-created with the Mayo Clinic is also in development for clinical reasoning.
Still wondering? A few quick answers below.
Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →





