Paper for this work: https://arxiv.org/abs/2205.05227
This is based on our previous work:
@inproceedings{D-DSVAE-VC,
title={Robust Disentangled Variational Speech Representation Learning for Zero-shot Voice Conversion},
author={Lian, Jiachen and Zhang, Chunlei and Yu, Dong},
booktitle={IEEE ICASSP},
year={2022},
organization={IEEE}
}
The previous demo is here https://jlian2.github.io/Robust-Voice-Style-Transfer/.
Demo for this work: https://jlian2.github.io/Improved-Voice-Conversion-with-Conditional-DSVAE/.