论文标题
最近的邻居分类器,有利的罚款
Nearest Neighbor Classifier with Margin Penalty for Active Learning
论文作者
论文摘要
随着深度学习成为自然语言处理领域的主流,对合适的主动学习方法的需求变得前所未有。提出了基于最近的邻居分类器的主动学习(AL)方法,并证明了较高的结果。但是,现有的最近的邻居分类器不适合分类相互独家类,因为无法通过最近的邻居分类器来确保类间差异。结果,无法发现边缘区域中的信息样本,并且性能损坏。为此,我们提出了一个新颖的邻居分类器,并为主动学习的边缘罚款(NCMAL)。首先,在类别之间增加了强制性的边缘惩罚,因此可以确保阶层间的差异和阶层内紧凑性。其次,提出了一种新型的样本选择策略,以发现边缘区域内的信息样本。为了证明这些方法的有效性,我们使用其他最先进的方法对数据集进行了广泛的实验。实验结果表明,与所有基线方法相比,我们的方法具有更少的带注释样品的结果。
As deep learning becomes the mainstream in the field of natural language processing, the need for suitable active learning method are becoming unprecedented urgent. Active Learning (AL) methods based on nearest neighbor classifier are proposed and demonstrated superior results. However, existing nearest neighbor classifier are not suitable for classifying mutual exclusive classes because inter-class discrepancy cannot be assured by nearest neighbor classifiers. As a result, informative samples in the margin area can not be discovered and AL performance are damaged. To this end, we propose a novel Nearest neighbor Classifier with Margin penalty for Active Learning(NCMAL). Firstly, mandatory margin penalty are added between classes, therefore both inter-class discrepancy and intra-class compactness are both assured. Secondly, a novel sample selection strategy are proposed to discover informative samples within the margin area. To demonstrate the effectiveness of the methods, we conduct extensive experiments on for datasets with other state-of-the-art methods. The experimental results demonstrate that our method achieves better results with fewer annotated samples than all baseline methods.