AI talent is here. Meet Mato 1.0Read the launch
DocsUpdated May 1, 2026

Configure voice and delivery

Set a voice, accent, speed, and delivery style on a talent profile so episodes sound exactly how you want.

6 min read
1055 words
voice
ElevenLabs
speech

Every talent in Mato carries its own voice configuration. The settings you choose here control how the talent sounds during episode audio rendering, and they apply to every podcast the talent is assigned to (unless overridden at the podcast level).

Open the voice identity card

Go to Talent in the sidebar, then click the talent you want to configure. Scroll down past the personality section until you see the Voice Identity card on the right column.

Voice Identity card on a talent profile

This card holds all voice and delivery settings for the talent. Changes here are saved independently from the personality editor, so you can update voice settings without touching the profile content.

Choose a voice

Mato supports two voice providers. The one you pick determines which voice library appears.

Gemini voices

The Gemini voice selector is the primary voice picker. Open the dropdown, search by name, gender, or style, and select a voice. Each entry shows the voice name, gender, and descriptive tags.

Click the speaker icon next to any voice to hear a short preview before committing. The preview generates a sample clip on the fly, so it may take a second to load.

Gemini voice selector with preview controls

Mato includes 29 Gemini voices (14 female, 15 male), each with a display name and style tags such as "warm", "expressive", or "conversational".

ElevenLabs voices (admin only)

The ElevenLabs voice selector appears only for super admins. It pulls voices from your connected ElevenLabs account. If you see only the Gemini selector, your account does not have ElevenLabs access enabled.

When both are configured, Mato uses the ElevenLabs voice if a voice ID is set, and falls back to the Gemini voice otherwise.

Set the accent

The accent selector offers common English accent presets: American, British, Australian, Canadian, Indian, Irish, Scottish, South African, and New Zealand.

Pick one from the dropdown, or type a custom accent in the search field if the presets do not cover what you need.

Adjust speech speed

The speech speed dropdown controls pacing during audio rendering. Four presets are available:

  • Slow sets a deliberate, measured pace with extra time between phrases.
  • Normal keeps the default conversational tempo.
  • Fast adds forward momentum with brisk delivery.
  • Very Fast creates rapid, high-energy output with minimal pauses.

Speed presets inject pacing directives into the TTS prompt that override other pacing guidance, so what you select here always takes priority.

Write an audio profile

The audio profile is a free-text field where you describe how the talent should sound in the final mix. Think of it as a note to the audio engine about the talent's overall sonic character.

Example: "Warm and intimate, as if speaking to a friend across a coffee table. Slight vocal fry on lower register."

Add director notes

Director notes guide tone, pacing, and delivery in more specific terms than the audio profile. These notes are included in the TTS prompt that controls how the voice renders each line.

Example: "Lead with curiosity, not authority. Pause briefly before key statistics. Avoid sounding rushed during transitions between segments."

Director notes and speech speed work together. If your notes mention pacing and you also set a speed preset, the speed preset takes priority for tempo, while the director notes still influence other aspects of delivery like emotion and emphasis.

Save your changes

After adjusting any voice settings, click Save at the bottom of the Voice Identity card. A confirmation toast appears on success. The page refreshes to reflect your saved values.

The save button stays disabled until you select at least one voice (Gemini or ElevenLabs).

Talent also needs a saved voice before it can be assigned to a podcast. If you try to add an unvoiced talent from a podcast's Talent tab, Mato shows a warning and keeps the assign action disabled.

Preview with audio samples

Audio samples let you hear how a talent sounds before using it in a real episode. You generate samples from the talent edit page, not from the Voice Identity card directly.

  1. Click Edit on the talent profile.
  2. Scroll to the Audio Samples section.
  3. Click Generate Audio Samples (requires a Gemini voice to be assigned first).
  4. Wait for the samples to appear. Each clip shows its label and duration, with a built-in audio player.

Audio samples section on the talent edit page

If samples already exist, the button changes to Regenerate Audio Samples. Regenerating replaces the previous set.

Back on the talent detail page, the Audio Samples card shows the generated clips so you can listen without entering the editor.

Per-podcast talent settings

When you assign a talent to a podcast, the podcast uses the talent's default voice settings. You can review the current assignment from the podcast settings page.

Go to Podcasts, open your podcast, click Settings, then select the Talent tab. This tab shows each assigned talent with their role, position, accent, and speech speed.

Talent tab in podcast settings showing assigned hosts

The assignment row shows the effective accent and speech speed for that podcast. In the normal UI, those values come from the talent's voice settings unless your workspace has assignment-specific values stored from an admin or migration workflow.

Click Manage voice on the assignment row to return to the talent profile and edit the talent's base voice settings. Changes saved on the Voice Identity card are applied to that talent and its linked podcast host records.

Quick reference

SettingWhere to find itWhat it controls
Gemini voiceVoice Identity cardWhich Gemini TTS voice renders audio
ElevenLabs voiceVoice Identity card (admin only)Which ElevenLabs voice renders audio
AccentVoice Identity cardEnglish accent variant
Speech speedVoice Identity cardPacing tempo (slow, normal, fast, very fast)
Audio profileVoice Identity cardFree-text sonic character description
Director notesVoice Identity cardFree-text delivery and tone guidance
Audio samplesTalent edit pagePreview clips to test voice configuration
Assignment reviewPodcast settings, Talent tabAssigned talent, role, position, and effective accent or speed

Still need help?

If the doc didn't solve it, open Intercom and ask us directly. Include the page you're on and what you expected to happen.

© 2026 Mato. All rights reserved. English · Multiple languages available