听他们说的话：更好地理解和检测用户反馈的在线错误信息

论文标题

听他们说的话：更好地理解和检测用户反馈的在线错误信息

Listen to what they say: Better understand and detect online misinformation with user feedback

论文作者

Etienne, Hubert, Çelebi, Onur

论文摘要

报告内容的社交媒体用户是在线错误信息管理中的关键盟友，但是，尚未进行研究以了解其角色和报告活动的不同趋势。我们提出了一种研究错误信息的原始方法：从报告用户的角度检查内容级别，并且在跨区域和平台之间进行了研究。我们提出了由c的评论引起的报告内容文章的第一个分类。 2020年6月，在法国，英国和美国的Facebook和Instagram上报告的9,000件项目。这使我们能够观察到有关国家和平台之间报告内容的有意义的区别，因为它的数量，类型，主题和操纵技术在量中有很大变化。在研究其中的六种技术时，我们确定了一种针对Instagram的新颖技术，比其他技术更复杂，可能对算法检测和人类节制提出了具体的挑战。我们还确定了四种报告行为，从中我们得出了四种类型的噪声，能够解释据报道为错误信息的内容中发现的一半不准确性。我们最终表明，将用户报告信号分解为多个行为，允许在一个小型数据集中训练一个简单但具有竞争力的分类器，并结合了基本用户报告的组合，以对不同类型的报告内容进行分类。

Social media users who report content are key allies in the management of online misinformation, however, no research has been conducted yet to understand their role and the different trends underlying their reporting activity. We suggest an original approach to studying misinformation: examining it from the reporting users perspective at the content-level and comparatively across regions and platforms. We propose the first classification of reported content pieces, resulting from a review of c. 9,000 items reported on Facebook and Instagram in France, the UK, and the US in June 2020. This allows us to observe meaningful distinctions regarding reporting content between countries and platforms as it significantly varies in volume, type, topic, and manipulation technique. Examining six of these techniques, we identify a novel one that is specific to Instagram US and significantly more sophisticated than others, potentially presenting a concrete challenge for algorithmic detection and human moderation. We also identify four reporting behaviours, from which we derive four types of noise capable of explaining half of the inaccuracy found in content reported as misinformation. We finally show that breaking down the user reporting signal into a plurality of behaviours allows to train a simple, although competitive, classifier on a small dataset with a combination of basic users-reports to classify the different types of reported content pieces.

下载PDF全文

下载文献需遵守相关版权规定

论文标题