解决实时歌剧跟踪中的朗诵问题

论文标题

解决实时歌剧跟踪中的朗诵问题

Addressing the Recitative Problem in Real-time Opera Tracking

论文作者

Brazier, Charles, Widmer, Gerhard

论文摘要

强大的实时歌剧跟踪（SCOREST）对于围绕现场歌剧舞台和流媒体的许多过程（包括自动歌词显示，摄像头控制或实时视频切割）非常有用。最近的工作表明，有了一些适当的措施来解决常见问题，例如中断和中断，自发的掌声，各种噪音和插曲，可以以相对强大的方式从头到尾遵循整个歌剧。但是，当文本内容变得突出反对旋律或音乐时，它们仍然不准确，尤其是在Recitativo段落中。在本文中，我们提议并行使用两个专门的跟踪器来解决这个特定问题，一个专注于音乐，另一个是对语音敏感的功能。我们首先就语音相关特征进行了系统的研究，以针对同一歌剧不同表演的相应读物的精确比对。然后，我们根据预先训练的音乐和语音分类器提出了不同的解决方案，以结合两个跟踪器，以提高整个歌剧过程中的全球准确性。

Robust real-time opera tracking (score following) would be extremely useful for many processes surrounding live opera staging and streaming, including automatic lyrics displays, camera control, or live video cutting. Recent work has shown that, with some appropriate measures to account for common problems such as breaks and interruptions, spontaneous applause, various noises and interludes, current audio-to-audio alignment algorithms can be made to follow an entire opera from beginning to end, in a relatively robust way. However, they remain inaccurate when the textual content becomes prominent against the melody or music -- notably, during recitativo passages. In this paper, we address this specific problem by proposing to use two specialized trackers in parallel, one focusing on music-, the other on speech-sensitive features. We first carry out a systematic study on speech-related features, targeting the precise alignment of corresponding recitatives from different performances of the same opera. Then we propose different solutions, based on pre-trained music and speech classifiers, to combine the two trackers in order to improve the global accuracy over the course of the entire opera.

下载PDF全文

下载文献需遵守相关版权规定

论文标题