论文标题

EVINDICEILMIX:使用开放式和封闭式嘈杂标签的组合学习

EvidentialMix: Learning with Combined Open-set and Closed-set Noisy Labels

论文作者

Sachdeva, Ragav, Cordeiro, Filipe R., Belagiannis, Vasileios, Reid, Ian, Carneiro, Gustavo

论文摘要

深度学习的功效取决于大规模的数据集,这些数据集已通过可靠的数据采集和注释过程精心策划。但是,以精确的注释获取如此大规模的数据集非常昂贵且耗时,而且廉价的替代方案通常会产生具有嘈杂标签的数据集。该领域通过关注两种类型的标签噪声下的训练模型来解决这个问题:1)封闭式噪声,其中一些训练样本被错误地注释给除了其已知的真实类别以外的培训标签; 2)开放式噪声,其中训练集包括(严格地)拥有的真实类别的样品,该类别不包含在已知的训练标签中。在这项工作中,我们研究了嘈杂标签问题的新变体,该标签问题结合了开放式噪声标签,并引入基准评估,以评估本设置下的培训算法的性能。我们认为,这样的问题更加笼统,并且更好地反映了实践中嘈杂的标签场景。此外,我们提出了一种称为EvidentialMix的新颖算法,该算法解决了此问题,并将其性能与提议基准上的封闭设置和开放式噪声的最新方法进行了比较。我们的结果表明,与以前的最新方法相比,我们的方法产生了卓越的分类结果和更好的特征表示。该代码可在https://github.com/ragavsachdeva/evidentialmix上找到。

The efficacy of deep learning depends on large-scale data sets that have been carefully curated with reliable data acquisition and annotation processes. However, acquiring such large-scale data sets with precise annotations is very expensive and time-consuming, and the cheap alternatives often yield data sets that have noisy labels. The field has addressed this problem by focusing on training models under two types of label noise: 1) closed-set noise, where some training samples are incorrectly annotated to a training label other than their known true class; and 2) open-set noise, where the training set includes samples that possess a true class that is (strictly) not contained in the set of known training labels. In this work, we study a new variant of the noisy label problem that combines the open-set and closed-set noisy labels, and introduce a benchmark evaluation to assess the performance of training algorithms under this setup. We argue that such problem is more general and better reflects the noisy label scenarios in practice. Furthermore, we propose a novel algorithm, called EvidentialMix, that addresses this problem and compare its performance with the state-of-the-art methods for both closed-set and open-set noise on the proposed benchmark. Our results show that our method produces superior classification results and better feature representations than previous state-of-the-art methods. The code is available at https://github.com/ragavsachdeva/EvidentialMix.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源