论文标题

ICML 2022的会议论文集表达性发声研讨会和竞争:认识,产生和个性化声音爆发

Proceedings of the ICML 2022 Expressive Vocalizations Workshop and Competition: Recognizing, Generating, and Personalizing Vocal Bursts

论文作者

Baird, Alice, Tzirakis, Panagiotis, Gidel, Gauthier, Jiralerspong, Marco, Muller, Eilif B., Mathewson, Kory, Schuller, Björn, Cambria, Erik, Keltner, Dacher, Cowen, Alan

论文摘要

这是ICML表达发声(EXVO)竞赛的会议记录。 EXVO竞争的重点是理解和产生声乐爆发:笑声,喘息,哭泣和其他非语言发声,这是情感表达和交流的核心。 EXVO 2022,包括三个竞赛曲目,使用1,702位扬声器的59,201个发声的大规模数据集。首先是Exvo-Multitask,要求参与者训练多任务模型,以识别声音爆发中表达的情绪和人口特征。第二个是exvo生成的,要求参与者训练一种产生声音爆发的生成模型,传达了十种不同的情绪。第三个exvo-fewshot要求参与者利用少量的学习融合说话者身份来训练模型,以识别声音爆发传达的10种情感。

This is the Proceedings of the ICML Expressive Vocalization (ExVo) Competition. The ExVo competition focuses on understanding and generating vocal bursts: laughs, gasps, cries, and other non-verbal vocalizations that are central to emotional expression and communication. ExVo 2022, included three competition tracks using a large-scale dataset of 59,201 vocalizations from 1,702 speakers. The first, ExVo-MultiTask, requires participants to train a multi-task model to recognize expressed emotions and demographic traits from vocal bursts. The second, ExVo-Generate, requires participants to train a generative model that produces vocal bursts conveying ten different emotions. The third, ExVo-FewShot, requires participants to leverage few-shot learning incorporating speaker identity to train a model for the recognition of 10 emotions conveyed by vocal bursts.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源