论文标题
国际象棋讨论是种族主义者吗?对抗性仇恨言论数据集
Are Chess Discussions Racist? An Adversarial Hate Speech Data Set
论文作者
论文摘要
2020年6月28日,安东尼奥·拉迪奇(AntonioRadić)的YouTube手柄在大师hikaru Nakamura上展示了国际象棋播客时,因为它包含“有害和危险”的内容。 YouTube没有给出进一步的特定原因,并且通道在24小时内恢复了。但是,拉迪奇推测,鉴于当前的政治局势,即“黑人对白人”的转介,尽管在国际象棋的背景下为他赢得了这一临时禁令。 In this paper, via a substantial corpus of 681,995 comments, on 8,818 YouTube videos hosted by five highly popular chess-focused YouTube channels, we ask the following research question: \emph{how robust are off-the-shelf hate-speech classifiers to out-of-domain adversarial examples?} We release a data set of 1,000 annotated comments where existing hate speech classifiers misclassified benign chess讨论是仇恨言论。我们的发现以一个有趣的类比结果指出,我们的发现指出了更广泛的颜色多义挑战。
On June 28, 2020, while presenting a chess podcast on Grandmaster Hikaru Nakamura, Antonio Radić's YouTube handle got blocked because it contained "harmful and dangerous" content. YouTube did not give further specific reason, and the channel got reinstated within 24 hours. However, Radić speculated that given the current political situation, a referral to "black against white", albeit in the context of chess, earned him this temporary ban. In this paper, via a substantial corpus of 681,995 comments, on 8,818 YouTube videos hosted by five highly popular chess-focused YouTube channels, we ask the following research question: \emph{how robust are off-the-shelf hate-speech classifiers to out-of-domain adversarial examples?} We release a data set of 1,000 annotated comments where existing hate speech classifiers misclassified benign chess discussions as hate speech. We conclude with an intriguing analogy result on racial bias with our findings pointing out to the broader challenge of color polysemy.