论文标题

音频相似性不可靠,作为音频质量的代理

Audio Similarity is Unreliable as a Proxy for Audio Quality

论文作者

Manocha, Pranay, Jin, Zeyu, Finkelstein, Adam

论文摘要

许多音频处理任务需要感知评估。但是,获得``黄金标准''人判断的时间和费用限制了此类数据的可用性。大多数应用程序都包含依赖清洁参考的完整参考或其他基于相似性的指标(例如PESQ)。研究人员依靠此类指标来评估和比较各种提出的方​​法,经常得出结论,较小的,测量的差异意味着一个方法比另一种方法更有效。本文展示了几种实用的方案,相似度指标不同意人类的看法,因为它们:(1)与干净的参考不同; (2)依靠人类在考虑质量时会考虑的属性,并且(3)对不可察觉的信号水平差异敏感。在这种情况下,我们表明,无参考指标不会遭受这种缺陷的困扰,并且与人类的看法更好。因此,我们得出结论,相似性是音频质量的不可靠代理。

Many audio processing tasks require perceptual assessment. However, the time and expense of obtaining ``gold standard'' human judgments limit the availability of such data. Most applications incorporate full reference or other similarity-based metrics (e.g. PESQ) that depend on a clean reference. Researchers have relied on such metrics to evaluate and compare various proposed methods, often concluding that small, measured differences imply one is more effective than another. This paper demonstrates several practical scenarios where similarity metrics fail to agree with human perception, because they: (1) vary with clean references; (2) rely on attributes that humans factor out when considering quality, and (3) are sensitive to imperceptible signal level differences. In those scenarios, we show that no-reference metrics do not suffer from such shortcomings and correlate better with human perception. We conclude therefore that similarity serves as an unreliable proxy for audio quality.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源