论文标题

att-hack:具有社会态度的表达性语音数据库

Att-HACK: An Expressive Speech Database with Social Attitudes

论文作者

Moine, Clément Le, Obin, Nicolas

论文摘要

本文介绍了ATT-Hack,这是第一个具有社会态度的演讲的大型数据库。可用的表达语音数据库很少见,并且通常仅限于主要情绪:愤怒,喜悦,悲伤,恐惧。这极大地限制了表达语音的研究范围。此外,这种数据库中总是忽略和缺少语音韵律的基本方面:它的多样性,即在改变其韵律时重复发言的可能性。本文是通过提供社会态度的行为言论数据库来扩大语音表达范围的首次尝试:友好,诱人,占主导地位和遥远。拟议的数据库包括25位在4种社会态度中解释100个话语的演讲者,每种态度的3-5个重复,总计约30个小时的演讲。根据Creative Commons许可,可以免费获得ATT-HACK的学术研究。

This paper presents Att-HACK, the first large database of acted speech with social attitudes. Available databases of expressive speech are rare and very often restricted to the primary emotions: anger, joy, sadness, fear. This greatly limits the scope of the research on expressive speech. Besides, a fundamental aspect of speech prosody is always ignored and missing from such databases: its variety, i.e. the possibility to repeat an utterance while varying its prosody. This paper represents a first attempt to widen the scope of expressivity in speech, by providing a database of acted speech with social attitudes: friendly, seductive, dominant, and distant. The proposed database comprises 25 speakers interpreting 100 utterances in 4 social attitudes, with 3-5 repetitions each per attitude for a total of around 30 hours of speech. The Att-HACK is freely available for academic research under a Creative Commons Licence.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源