论文标题
2022年Russo-Ikrainian冲突的Reddit数据集
A Reddit Dataset for the Russo-Ukrainian Conflict in 2022
论文作者
论文摘要
Reddit由涵盖重点主题的子社区组成。本文提供了正在进行的Russo-Ikrainian危机的相关子列表列表。我们使用关键字搜索和入围12个子列表作为包含与危机相关的名义话语的潜在候选人进行详尽的子雷达探索。这些子列表共同包含300,000多个帖子和800万条评论。我们基于其主要重点,将内容分为两类“ R-U冲突”和“军事相关”。我们进一步执行这些子列表的内容表征。结果表明,俄罗斯发起入侵后不久,帖子和评论激增。与“ R-U冲突”帖子相比,“与军事相关”的职位更有可能收到更多的答复。我们的文本分析表明,在“ R-U冲突”中对亲乌克兰的立场显然偏爱,而“军事相关”保留了中立的立场。
Reddit consists of sub-communities that cover a focused topic. This paper provides a list of relevant subreddits for the ongoing Russo-Ukrainian crisis. We perform an exhaustive subreddit exploration using keyword search and shortlist 12 subreddits as potential candidates that contain nominal discourse related to the crisis. These subreddits contain over 300,000 posts and 8 million comments collectively. We provide an additional categorization of content into two categories, "R-U Conflict", and "Military Related", based on their primary focus. We further perform content characterization of those subreddits. The results show a surge of posts and comments soon after Russia launched the invasion. "Military Related" posts are more likely to receive more replies than "R-U Conflict" posts. Our textual analysis shows an apparent preference for the Pro-Ukraine stance in "R-U Conflict", while "Military Related" retain a neutral stance.