Avatar V API is live at $0.05/sec The highest-quality AI avatar model for developers Benchmarked against Veo 3.1, Kling O3 Pro, OmniHuman 1.5, and Seedance 2.0 on cross-scene talking-head generation Avatar V won every category Research report + API ↓ https://t.co/WDbK1MeAT0
HeyGen Launches Avatar V API for High Fidelity Programmatic Video
HeyGen· Updated
HeyGen released the API for its Avatar V rendering engine, allowing developers to generate high-fidelity talking-head videos at $0.05 per second. The model uses cross-reference-driven animation to achieve more natural lip-sync and body motion than previous generations.
- Pricing
- $0.05 per second
- API version
- v3
- Default engine
- Avatar IV
- Animation method
- Cross-reference driven
- Supported types
- Digital twins, studio avatars, and more
The release shifts HeyGen's focus toward programmatic scale. By benchmarking Avatar V against frontier models like Google's Veo 3.1 and Kling O3 Pro, the company positions its specialized architecture as the superior choice for professional presenters. This move complements the recently released HeyGen CLI for automated production.
You can access the engine via the v3/videos endpoint by setting the engine parameter to avatar_v. Usage is priced at $0.05 per second of generated video. Developers must first check a digital twin's eligibility through the API, as the engine does not yet support arbitrary image inputs.
Still wondering? A few quick answers below.
Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →
