ICML 2022的会议论文集表达性发声研讨会和竞争：认识，产生和个性化声音爆发

论文标题

ICML 2022的会议论文集表达性发声研讨会和竞争：认识，产生和个性化声音爆发

Proceedings of the ICML 2022 Expressive Vocalizations Workshop and Competition: Recognizing, Generating, and Personalizing Vocal Bursts

论文作者

Baird, Alice, Tzirakis, Panagiotis, Gidel, Gauthier, Jiralerspong, Marco, Muller, Eilif B., Mathewson, Kory, Schuller, Björn, Cambria, Erik, Keltner, Dacher, Cowen, Alan

论文摘要

这是ICML表达发声（EXVO）竞赛的会议记录。 EXVO竞争的重点是理解和产生声乐爆发：笑声，喘息，哭泣和其他非语言发声，这是情感表达和交流的核心。 EXVO 2022，包括三个竞赛曲目，使用1,702位扬声器的59,201个发声的大规模数据集。首先是Exvo-Multitask，要求参与者训练多任务模型，以识别声音爆发中表达的情绪和人口特征。第二个是exvo生成的，要求参与者训练一种产生声音爆发的生成模型，传达了十种不同的情绪。第三个exvo-fewshot要求参与者利用少量的学习融合说话者身份来训练模型，以识别声音爆发传达的10种情感。

This is the Proceedings of the ICML Expressive Vocalization (ExVo) Competition. The ExVo competition focuses on understanding and generating vocal bursts: laughs, gasps, cries, and other non-verbal vocalizations that are central to emotional expression and communication. ExVo 2022, included three competition tracks using a large-scale dataset of 59,201 vocalizations from 1,702 speakers. The first, ExVo-MultiTask, requires participants to train a multi-task model to recognize expressed emotions and demographic traits from vocal bursts. The second, ExVo-Generate, requires participants to train a generative model that produces vocal bursts conveying ten different emotions. The third, ExVo-FewShot, requires participants to leverage few-shot learning incorporating speaker identity to train a model for the recognition of 10 emotions conveyed by vocal bursts.

下载PDF全文

下载文献需遵守相关版权规定

论文标题