Fast, multi-speaker TTS (44.1kHz) with voice cloning
Efficient, fast, and natural text to speech with StyleTTS 2!