Pomni

How to Upload Voice for Cloning

For a voice clone to sound true to life, clean recordings matter. A few minutes of good speech give the best result.

Which recordings work

Best of all is clean, calm speech without noise or music, around 1–3 minutes in total. Videos, voice messages, dictaphone and digitized recordings all work, as long as only the person's voice can be heard. Several short clean clips are better than one noisy recording.

What to avoid

Background music, several voices at once, echo, and heavy noise noticeably worsen the clone. If you have a choice, pick the cleanest clips, even short ones. Less material of good quality beats more of poor quality.

Where to find the voice

Check home videos, birthday greetings, voice messages in messengers, old tapes, and answering machines. The audio track can be extracted from a video. Ask relatives to share any recordings they have.

Uploading

On the training page, add the audio files to the voice section and start the cloning. The result is used only for this AI copy and isn't shared with anyone. The finished clone voices the image's replies in his recognizable voice.

  • Clean speech, 1–3 minutes.
  • No music, echo, or other voices.
  • Look in videos, messengers, on tapes.
  • The clone is used only for this copy.

Frequently asked questions

How many recordings does the voice need?
A few minutes of clean speech is enough; quality matters more than total length.
The voice is only on a noisy video — will it work?
Pick the cleanest segments if you can; heavy noise and music reduce the clone's fidelity.

Save the story while it is with you

Create a memorial page in a few minutes — gently, beautifully and with respect for your loved ones. Free forever for the text version.

Create a memorial
Pomni editors

We help families gently preserve the memory of their loved ones. The materials are written with respect for the subject of loss and are regularly updated. About · Support resources

Read also