NNSVS：基于神经网络的歌声综合工具包

论文标题

NNSVS：基于神经网络的歌声综合工具包

NNSVS: A Neural Network-Based Singing Voice Synthesis Toolkit

论文作者

Yamamoto, Ryuichi, Yoneyama, Reo, Toda, Tomoki

论文摘要

本文介绍了NNSVS的设计，NNSV是一种开源软件，用于基于神经网络的唱歌语音综合研究。 NNSV的灵感来自Sinsy，这是唱歌语音综合研究的开源先驱，并提供了许多其他功能，例如多流模型，自回归基本频率模型和神经声码器。此外，NNSV提供了广泛的文档和许多脚本来构建完整的歌声综合系统。实验结果表明，我们的最佳系统显着胜过我们对罪恶和其他基线系统的繁殖。该工具包可从https://github.com/nnsvs/nnsvs获得。

This paper describes the design of NNSVS, an open-source software for neural network-based singing voice synthesis research. NNSVS is inspired by Sinsy, an open-source pioneer in singing voice synthesis research, and provides many additional features such as multi-stream models, autoregressive fundamental frequency models, and neural vocoders. Furthermore, NNSVS provides extensive documentation and numerous scripts to build complete singing voice synthesis systems. Experimental results demonstrate that our best system significantly outperforms our reproduction of Sinsy and other baseline systems. The toolkit is available at https://github.com/nnsvs/nnsvs.

下载PDF全文

下载文献需遵守相关版权规定

论文标题