Hi @Wendison
Thank you so much for this great work.
I fine-tuned (resumed) pretrained model (use_CSMI=True use_CPMI=True use_PSMI=True) with indicTTS dataset (20 speakers - each having 1 hour audios)
the model trained with 1000 epochs.
Quality gets better for the target speaker. but source speaker modulation is not converted.
Can you please give your suggestions?
Thanks
Hi @Wendison
Thank you so much for this great work.
I fine-tuned (resumed) pretrained model (use_CSMI=True use_CPMI=True use_PSMI=True) with indicTTS dataset (20 speakers - each having 1 hour audios)
the model trained with 1000 epochs.
Quality gets better for the target speaker. but source speaker modulation is not converted.
Can you please give your suggestions?
Thanks