F5-TTS-Emotion-CFG introudces explicit emotion conditioning in F5-TTS zero-shot voice cloning model, by fine-tuning on ESD dataset. The following emotions are supported: Neutral, Happy, Sad, Angry and ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results