论文标题
你看到我看到的吗?自动多媒体内容分析的功能和限制
Do You See What I See? Capabilities and Limits of Automated Multimedia Content Analysis
论文作者
论文摘要
近年来,在线用户生成的内容不断增加,导致了自动化内容分析工具的研究和投资的扩展。在COVID-19大流行期间,对自动内容分析的审查已经加快了,因为社交网络服务对这些工具的依赖更加依赖,因为担心对其适应员工的健康风险来自面对面的工作。同时,全世界都有关于如何改善内容适度的同时保护自由表达和隐私的重要政策辩论。为了推进这些辩论,我们需要了解自动化内容分析工具的潜在作用。 本文解释了分析在线多媒体内容的工具的功能和局限性,并强调了在不考虑其限制的情况下大规模使用这些工具的潜在风险。它重点介绍了两种主要工具类别:匹配模型和计算机预测模型。匹配模型包括加密和感知哈希,它们将用户生成的内容与现有内容和已知内容进行比较。预测模型(包括计算机视觉和计算机试听)是机器学习技术,旨在确定新内容或以前未知内容的特征。
The ever-increasing amount of user-generated content online has led, in recent years, to an expansion in research and investment in automated content analysis tools. Scrutiny of automated content analysis has accelerated during the COVID-19 pandemic, as social networking services have placed a greater reliance on these tools due to concerns about health risks to their moderation staff from in-person work. At the same time, there are important policy debates around the world about how to improve content moderation while protecting free expression and privacy. In order to advance these debates, we need to understand the potential role of automated content analysis tools. This paper explains the capabilities and limitations of tools for analyzing online multimedia content and highlights the potential risks of using these tools at scale without accounting for their limitations. It focuses on two main categories of tools: matching models and computer prediction models. Matching models include cryptographic and perceptual hashing, which compare user-generated content with existing and known content. Predictive models (including computer vision and computer audition) are machine learning techniques that aim to identify characteristics of new or previously unknown content.