论文标题
在社交网络上检测进攻语言:基于图形注意网络的端到端检测方法
Detecting Offensive Language on Social Networks: An End-to-end Detection Method based on Graph Attention Networks
论文作者
论文摘要
进攻语言在社交网络上的普遍性对社会产生了不利影响,例如在线虐待行为。迫切需要检测进攻性语言并遏制其传播。现有研究表明,具有社区结构特征的方法有效地改善了进攻性语言检测的性能。但是,现有模型独立处理社区结构,这严重影响了检测模型的有效性。在本文中,我们提出了一种基于社区结构和进攻性语言检测的文本特征(CT GOLD)的端到端方法。具体而言,社区结构特征是图形注意力网络层直接捕获的,并且文本嵌入是从Bert的最后一个隐藏层中获取的。注意机制和位置编码用于融合这些功能。同时,我们将用户意见添加到代表用户功能的社区结构中。用户意见由用户历史行为信息表示,该信息的表现优于文本信息表示。除上述点外,在流行的数据集中,用户和推文的分布是不平衡的,这限制了模型的概括能力。要解决此问题,我们构建并发布具有合理用户分布的数据集。我们的方法的表现优于基准,F1得分为89.94%。结果表明,端到端模型有效地了解了社区结构和文本的潜在信息,并且用户历史行为信息更适合用户意见表示。
The pervasiveness of offensive language on the social network has caused adverse effects on society, such as abusive behavior online. It is urgent to detect offensive language and curb its spread. Existing research shows that methods with community structure features effectively improve the performance of offensive language detection. However, the existing models deal with community structure independently, which seriously affects the effectiveness of detection models. In this paper, we propose an end-to-end method based on community structure and text features for offensive language detection (CT-OLD). Specifically, the community structure features are directly captured by the graph attention network layer, and the text embeddings are taken from the last hidden layer of BERT. Attention mechanisms and position encoding are used to fuse these features. Meanwhile, we add user opinion to the community structure for representing user features. The user opinion is represented by user historical behavior information, which outperforms that represented by text information. Besides the above point, the distribution of users and tweets is unbalanced in the popular datasets, which limits the generalization ability of the model. To address this issue, we construct and release a dataset with reasonable user distribution. Our method outperforms baselines with the F1 score of 89.94%. The results show that the end-to-end model effectively learns the potential information of community structure and text, and user historical behavior information is more suitable for user opinion representation.