How to use Fhrozen/tts_prodiff_eng_mspk with ESPnet:
from espnet2.bin.tts_inference import Text2Speech model = Text2Speech.from_pretrained("Fhrozen/tts_prodiff_eng_mspk") speech, *_ = model("text to generate speech from")
No support given.
num_iters_per_epoch: 250 max_epoch: 800 batch_bins: 8000000 tts_conf: spk_embed_dim: 192