论文标题
与自适应分类器不平衡的半监督学习
Imbalanced Semi-supervised Learning with Bias Adaptive Classifier
论文作者
论文摘要
伪标记已被证明是一种有希望的半监督学习(SSL)范式。现有的伪标记方法通常假定培训数据的类别分布是平衡的。但是,这样的假设远非现实的场景,因此严重限制了当前伪标记方法在阶级不平衡的背景下的性能。为了减轻这个问题,我们设计了一个针对不平衡SSL设置的偏置自适应分类器。核心思想是通过偏见自适应分类器自动吸收由阶级失衡引起的训练偏见,该分类器由新颖的偏见吸引子和原始线性分类器组成。偏见吸引子被设计为轻巧的残留网络,并通过双层学习框架进行了优化。这种学习策略使偏置自适应分类器能够符合不平衡的训练数据,而线性分类器可以为每个类提供无偏的标签预测。我们在各种不平衡的半监督设置下进行了广泛的实验,结果表明我们的方法可以应用于不同的伪标记模型,并且优于当前的最新方法。
Pseudo-labeling has proven to be a promising semi-supervised learning (SSL) paradigm. Existing pseudo-labeling methods commonly assume that the class distributions of training data are balanced. However, such an assumption is far from realistic scenarios and thus severely limits the performance of current pseudo-labeling methods under the context of class-imbalance. To alleviate this problem, we design a bias adaptive classifier that targets the imbalanced SSL setups. The core idea is to automatically assimilate the training bias caused by class imbalance via the bias adaptive classifier, which is composed of a novel bias attractor and the original linear classifier. The bias attractor is designed as a light-weight residual network and optimized through a bi-level learning framework. Such a learning strategy enables the bias adaptive classifier to fit imbalanced training data, while the linear classifier can provide unbiased label prediction for each class. We conduct extensive experiments under various imbalanced semi-supervised setups, and the results demonstrate that our method can be applied to different pseudo-labeling models and is superior to current state-of-the-art methods.