Your agent's voice is their sonic identity. Choose from an extensive library of professional voices, or create a custom voice from a recording.

Three options available

1. Library of existing voices

Access a wide selection of high-quality pre-trained voices via ElevenLabs and Cartesia. Each voice is available in several tones: formal, casual, warm, authoritative…

2. Custom Voice Cloning

Create a synthetic voice that sounds like a real person speaking. The cloned voice can be used in Pipeline and Dualplex modes.

SupplierRequirements for cloning
CartesiaA single audio file, minimum 10 seconds, one speaker only, with no background noise
ElevenLabsSeveral samples, more than a minute in total, a single speaker, with no background noise

3. Cartesia Voice Sonic 3 (New)

The Cartesia Sonic 3 TTS engine delivers high-fidelity audio quality with advanced emotion control. It supports voice cloning and SAML tags to adjust pitch, intensity, and expressiveness in real time.

Advanced voice settings

SettingBeachEffect
Temperature0.0 – 1.0Lower pitch = stable but less expressive voice. Higher pitch = more dynamic and creative voice.
Silence period before hanging up30-45 secWaiting time if the other party does not answer before ending the call.
Maximum call duration20 – 1200 secAbsolute limit on the duration of a call to control costs.