论文标题
Lightme:在互联网支持小组中分析语言的心理健康
Lightme: Analysing Language in Internet Support Groups for Mental Health
论文作者
论文摘要
背景:协助主持人在互联网支持小组中进行分类有害帖子,以确保其安全使用。自动化文本分类方法分析在线论坛帖子中表达的语言是一个有希望的解决方案。方法:使用自然语言处理和机器学习技术用于使用来自Reachout心理健康论坛的年轻人的数据集构建分类后分类器。结果:与最先进的解决方案进行比较时,该解决方案主要基于词汇资源的特征,获得了危机柱(52%)的最佳分类性能,这是最严重的类别。分析危机邮报时发现了六个显着的语言特征。 1)表达绝望的帖子,2)表达简洁的负面情绪反应的简短帖子,3)表达情绪变化的长篇文章,4)帖子表达对可用健康服务不满意的帖子,5)使用讲故事的帖子和6)帖子表达了在危机期间向同龄人寻求建议的用户。结论:只能使用仅从帖子的文本内容衍生的功能来构建竞争性分类分类器。为了将我们的定量和定性发现转化为特征,需要进行进一步的研究,因为它可以提高整体性能。
Background: Assisting moderators to triage harmful posts in Internet Support Groups is relevant to ensure its safe use. Automated text classification methods analysing the language expressed in posts of online forums is a promising solution. Methods: Natural Language Processing and Machine Learning technologies were used to build a triage post classifier using a dataset from Reachout mental health forum for young people. Results: When comparing with the state-of-the-art, a solution mainly based on features from lexical resources, received the best classification performance for the crisis posts (52%), which is the most severe class. Six salient linguistic characteristics were found when analysing the crisis post; 1) posts expressing hopelessness, 2) short posts expressing concise negative emotional responses, 3) long posts expressing variations of emotions, 4) posts expressing dissatisfaction with available health services, 5) posts utilising storytelling, and 6) posts expressing users seeking advice from peers during a crisis. Conclusion: It is possible to build a competitive triage classifier using features derived only from the textual content of the post. Further research needs to be done in order to translate our quantitative and qualitative findings into features, as it may improve overall performance.