Introducing new modalities for ElevenAgents Your customers don't just talk or type. They send photos, files, voice notes, and locations, and reach out across channels. Now your agents handle all of it. https://t.co/dx4B4GchPu
ElevenLabs Adds Multimodal Input to ElevenAgents for End-to-End Task Resolution
· Updated
- New input modalities
- Images, PDFs, Audio notes, and more
- Supported channels
- WhatsApp, Web Widget, In-app, and more
- Context management
- Cross-channel persistence
- Availability
- Available now
- Integration options
- WhatsApp docs and widget docs
This update addresses the handoff bottleneck where AI agents previously required human intervention to verify documents. By integrating these senses into ElevenLabs' business workflow templates, companies can automate lifecycles—like the ElevenLabs banking support workflows recently deployed—where proof of address or medical records are mandatory for completion.
You can deploy these capabilities now through the ElevenAgents dashboard for web widgets and WhatsApp. The system preserves context across channels, enabling an agent to start a voice call and transition to WhatsApp to process a signed PDF. These features are available to all users currently building with the platform.
Still wondering? A few quick answers below.


