通过对抗性学习检测微博客帖子中低资源领域的谣言

论文标题

通过对抗性学习检测微博客帖子中低资源领域的谣言

Detect Rumors in Microblog Posts for Low-Resource Domains via Adversarial Contrastive Learning

论文作者

Lin, Hongzhan, Ma, Jing, Chen, Liangliang, Yang, Zhiwei, Cheng, Mingfei, Chen, Guang

论文摘要

大量的虚假谣言随着突发新闻或趋势主题的出现，严重阻碍了事实。现有的谣言检测方法在昨天的新闻上取得了令人鼓舞的表现，因为从同一领域收集了足够的语料库来进行模型培训。但是，由于缺乏培训数据和先验知识（即低资源制度），他们在发现有关不可预见事件的谣言方面很差。在本文中，我们提出了一个对抗性对比学习框架，以通过调整从资源丰富的谣言数据中学到的特征来检测谣言，从而探讨了资源低的谣言。我们的模型明确地克服了通过语言一致性和新颖的监督对比培训范式对域和/或语言使用的限制。此外，我们开发了一种对抗性增强机制，以进一步增强低资源谣言的鲁棒性。从现实世界中的微博平台收集的两个低资源数据集进行的广泛实验表明，我们的框架的性能要比最先进的方法更好，并且在早期阶段表现出较高的谣言的能力。

Massive false rumors emerging along with breaking news or trending topics severely hinder the truth. Existing rumor detection approaches achieve promising performance on the yesterday's news, since there is enough corpus collected from the same domain for model training. However, they are poor at detecting rumors about unforeseen events especially those propagated in different languages due to the lack of training data and prior knowledge (i.e., low-resource regimes). In this paper, we propose an adversarial contrastive learning framework to detect rumors by adapting the features learned from well-resourced rumor data to that of the low-resourced. Our model explicitly overcomes the restriction of domain and/or language usage via language alignment and a novel supervised contrastive training paradigm. Moreover, we develop an adversarial augmentation mechanism to further enhance the robustness of low-resource rumor representation. Extensive experiments conducted on two low-resource datasets collected from real-world microblog platforms demonstrate that our framework achieves much better performance than state-of-the-art methods and exhibits a superior capacity for detecting rumors at early stages.

下载PDF全文

下载文献需遵守相关版权规定

论文标题