Generate speech in a chosen voice from a short audio sample
Generate speech from text using a reference voice
Transform and identify speech with MMS
Generate spoken audio from text in multiple languages
Generate a talking face video from an image and audio