论文标题
基于语言信息的文本复杂性分类:应用于ESL的智能辅导
Text Complexity Classification Based on Linguistic Information: Application to Intelligent Tutoring of ESL
论文作者
论文摘要
这项工作的目的是建立一个分类器,该分类器可以在教导阅读英语作为第二语言(ESL)学习者的背景下识别文本复杂性。为了向语言学习者提供适合其英语水平的文本,可以确定一套可以描述给定文本的语音,形态,词汇,句法,句法,话语和心理复杂性的特征。 ESL专家使用6171个文本的语料库已经将其分为三个不同的难度,使用了五种机器学习算法进行了不同的实验。结果表明,采用的语言特征提供了良好的总体分类表现(F-评分= 0.97)。进行了可伸缩性评估,以测试是否可以在真实应用程序中使用此类分类器,例如,它可以将其插入搜索引擎或网络剪裁模块中。在此评估中,测试集中的文本不仅不同于培训集的文本,而且不同于不同类型的文本(ESL文本与儿童阅读文本)。尽管分类器的总体性能显着降低(F-SCORE = 0.65),但混淆矩阵表明,大多数分类误差均在两类和三类(中级类)之间,并且该系统在对一级和四类的文本分类方面具有良好的性能。这种行为可以通过两个语料库之间的分类标准差异来解释。因此,观察到的结果证实了实际应用程序中这种分类器的可用性。
The goal of this work is to build a classifier that can identify text complexity within the context of teaching reading to English as a Second Language (ESL) learners. To present language learners with texts that are suitable to their level of English, a set of features that can describe the phonological, morphological, lexical, syntactic, discursive, and psychological complexity of a given text were identified. Using a corpus of 6171 texts, which had already been classified into three different levels of difficulty by ESL experts, different experiments were conducted with five machine learning algorithms. The results showed that the adopted linguistic features provide a good overall classification performance (F-Score = 0.97). A scalability evaluation was conducted to test if such a classifier could be used within real applications, where it can be, for example, plugged into a search engine or a web-scraping module. In this evaluation, the texts in the test set are not only different from those from the training set but also of different types (ESL texts vs. children reading texts). Although the overall performance of the classifier decreased significantly (F-Score = 0.65), the confusion matrix shows that most of the classification errors are between the classes two and three (the middle-level classes) and that the system has a robust performance in categorizing texts of class one and four. This behavior can be explained by the difference in classification criteria between the two corpora. Hence, the observed results confirm the usability of such a classifier within a real-world application.