Soul-AILab/SoulX-Podcast-1.7B-dialect
Text-to-Speech
β’
2B
β’
Updated
β’
165
β’
24
Generate custom portraits preserving your face identity
Replace objects in images using prompts or reference images
Generate speech in a chosen voice from a short audio sample
Generate music from text descriptions and optional melodies
Transcribe speech from audio or YouTube videos into text
Transcribe audio files to text instantly