Skip to main content

Voice Wrangling

Text-to-Speech, Automatic Subtitles, ...

Text-to-Speech

Slow but best quality and open-source:

Fast and open-source:

Proprietary:

Real-time Voice Morphing

Voice Cloning

Speech recognition

Whisper (also see Show and Tell)

Online Services:

Diarization (differentiate people in a conversation):

Noise removal

Tools

Convert English Graphemes to Phonemes