论文标题

场景捕获:场景假音频检测的初始数据集和基准

SceneFake: An Initial Dataset and Benchmarks for Scene Fake Audio Detection

论文作者

Yi, Jiangyan, Wang, Chenglong, Tao, Jianhua, Zhang, Chu Yuan, Fan, Cunhang, Tian, Zhengkun, Ma, Haoxin, Fu, Ruibo

论文摘要

许多数据集旨在进一步开发假音频检测。但是,以前数据集中的虚假话语主要是通过更改音色,韵律,语言内容或原始音频的通道噪声而产生的。这些数据集忽略了一个方案,其中原始音频的声学场景被伪造的音频操纵。如果有些人以恶意的目的滥用操纵的音频,这将对我们的社会构成重大威胁。因此,这促使我们填补了空白。本文提出了这样一个名为“ caspefake”的场景的数据集,该数据集名为Faste -Fake,在其中仅通过使用语音增强技术来篡改真实话语的声音场景来生成一个操纵的音频。本文报道了一些场景在场景捕获数据集上的假音频检测基准结果。此外,本文介绍了使用不同语音增强技术和信噪比的虚假攻击分析。结果表明,在ASVSPOOF 2019数据集中训练的基线模型无法可靠地检测到伪造的话语。尽管这些模型在场景摄影训练集和看到测试集中表现良好,但在看不见的测试集中,它们的性能很差。数据集(https://zenodo.org/record/7663324#.y_xkmupyuuk)和基准源代码(https://github.com/addchallenge/scenefake)公开。

Many datasets have been designed to further the development of fake audio detection. However, fake utterances in previous datasets are mostly generated by altering timbre, prosody, linguistic content or channel noise of original audio. These datasets leave out a scenario, in which the acoustic scene of an original audio is manipulated with a forged one. It will pose a major threat to our society if some people misuse the manipulated audio with malicious purpose. Therefore, this motivates us to fill in the gap. This paper proposes such a dataset for scene fake audio detection named SceneFake, where a manipulated audio is generated by only tampering with the acoustic scene of an real utterance by using speech enhancement technologies. Some scene fake audio detection benchmark results on the SceneFake dataset are reported in this paper. In addition, an analysis of fake attacks with different speech enhancement technologies and signal-to-noise ratios are presented in this paper. The results indicate that scene fake utterances cannot be reliably detected by baseline models trained on the ASVspoof 2019 dataset. Although these models perform well on the SceneFake training set and seen testing set, their performance is poor on the unseen test set. The dataset (https://zenodo.org/record/7663324#.Y_XKMuPYuUk) and benchmark source codes (https://github.com/ADDchallenge/SceneFake) are publicly available.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源