论文标题

VCC2020的Sinica Sinica语音转换系统

The Academia Sinica Systems of Voice Conversion for VCC2020

论文作者

Peng, Yu-Huai, Hu, Cheng-Hung, Kang, Alexander, Lee, Hung-Shin, Chen, Pin-Yuan, Tsao, Yu, Wang, Hsin-Min

论文摘要

本文介绍了2020年语音转换挑战的两项任务的Sinica Systems,即同一语言(任务1)和跨语言语音转换(任务2)中的语音转换。对于这两个任务,我们使用语音令牌作为TTS输入而不是文本或字符遵循级联的ASR+TTS结构。对于任务1,我们将国际语音字母(IPA)用作TTS模型的输入。对于任务2,我们使用了由矢量定量的变异自动编码器(VQVAE)提取的无监督的语音符号。在评估中,听力测试表明,我们的系统在VCC2020挑战中表现良好。

This paper describes the Academia Sinica systems for the two tasks of Voice Conversion Challenge 2020, namely voice conversion within the same language (Task 1) and cross-lingual voice conversion (Task 2). For both tasks, we followed the cascaded ASR+TTS structure, using phonetic tokens as the TTS input instead of the text or characters. For Task 1, we used the international phonetic alphabet (IPA) as the input of the TTS model. For Task 2, we used unsupervised phonetic symbols extracted by the vector-quantized variational autoencoder (VQVAE). In the evaluation, the listening test showed that our systems performed well in the VCC2020 challenge.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源