Skip to main content
A voice defines how your avatar sounds. LiveAvatar pairs real-time lip-synced video with natural-sounding speech, and the voice you choose shapes the entire audio experience — tone, accent, pacing, and personality.
Voices are used in FULL mode only. In LITE mode, you bring your own audio pipeline, so voice selection and configuration are handled on your side.

What voices are available

Voice library

LiveAvatar includes a library of voices spanning different languages, genders, and speaking styles. You can preview any voice before committing to it, making it easy to audition options and find the right fit.

Avatar-generated voices

When you create a custom avatar from video footage, LiveAvatar also generates a voice from that footage. This gives your avatar a voice that naturally matches its appearance — no extra setup needed.

Bring your own voice

In addition to the above, you can import voices from third-party TTS providers. This opens the door to custom-cloned voices, branded voices, or any voice available in a provider’s catalog. See Custom TTS Integration for a step-by-step guide on adding your own voice.

How to use a voice in a LiveAvatar session

LiveAvatar gives you control over how your avatar sounds when starting a session. Every avatar comes with a default voice, but you can override it with any available voice. Once you have selected a voice, you can fine-tune how it sounds at session time. The available controls depend on the underlying provider — for example, you can adjust speed, stability, style, and more. These settings are optional and sensible defaults are applied if you skip them. For a full breakdown of available controls by provider, see the Voice Settings page.