主题和深层变异模型的可解释的假新闻检测

论文标题

主题和深层变异模型的可解释的假新闻检测

Interpretable Fake News Detection with Topic and Deep Variational Models

论文作者

Hosseini, Marjan, Sabet, Alireza Javadian, He, Suining, Aguiar, Derek

论文摘要

社会对社交媒体和用户为新闻和信息产生的内容的依赖不断增长，这增加了不可靠的资源和虚假内容的影响，这使公众的讨论融为一体，并减少了对媒体的信任。验证此类信息的可信度是一项艰巨的任务，容易受到确认偏见的影响，从而开发了算法技术以区分假新闻和真实新闻。但是，大多数现有的方法都具有挑战性的解释，使得难以建立对预测的信任，并在许多现实世界中（例如，视听特征或出处的可用性）做出不现实的假设。在这项工作中，我们专注于使用可解释的功能和方法对文本内容的假新闻检测。特别是，我们开发了一个深层的概率模型，该模型使用各种自动编码器和双向长期记忆（LSTM）网络（LSTM）网络与语义主题相关的特征从贝叶斯混合模型推论。使用3个现实世界数据集的广泛的实验研究表明，我们的模型可与最先进的竞争模型相媲美，同时促进了从学习的主题中解释模型。最后，我们进行了模型消融研究，以证明整合神经嵌入和主题特征的有效性和准确性是通过在较低维嵌入中可分离性评估性能和质量性来定量的。

The growing societal dependence on social media and user generated content for news and information has increased the influence of unreliable sources and fake content, which muddles public discourse and lessens trust in the media. Validating the credibility of such information is a difficult task that is susceptible to confirmation bias, leading to the development of algorithmic techniques to distinguish between fake and real news. However, most existing methods are challenging to interpret, making it difficult to establish trust in predictions, and make assumptions that are unrealistic in many real-world scenarios, e.g., the availability of audiovisual features or provenance. In this work, we focus on fake news detection of textual content using interpretable features and methods. In particular, we have developed a deep probabilistic model that integrates a dense representation of textual news using a variational autoencoder and bi-directional Long Short-Term Memory (LSTM) networks with semantic topic-related features inferred from a Bayesian admixture model. Extensive experimental studies with 3 real-world datasets demonstrate that our model achieves comparable performance to state-of-the-art competing models while facilitating model interpretability from the learned topics. Finally, we have conducted model ablation studies to justify the effectiveness and accuracy of integrating neural embeddings and topic features both quantitatively by evaluating performance and qualitatively through separability in lower dimensional embeddings.

下载PDF全文

下载文献需遵守相关版权规定

论文标题