🎙️ MOSS-TTS-PNY

Speaker-conditioned text-to-speech with emotion and energy control. Fine-tuned MOSS-TTS checkpoint featuring character voices.

Model: ZDisket/MOSS-TTS-PNY

Speaker
Emotion
0 1

Max 30s of audio output

Examples