论文标题

弥合受监督和无监督学习之间的差距,以进行细粒度分类

Bridge the Gap between Supervised and Unsupervised Learning for Fine-Grained Classification

论文作者

Wang, Jiabao, Li, Yang, Wei, Xiu-Shen, Li, Hang, Miao, Zhuang, Zhang, Rui

论文摘要

无监督的学习技术已经赶上了甚至超过一般对象分类(GOC)和人员重新识别(RE-ID)的监督学习技术。但是,发现无监督的细粒度视觉分类(FGVC)比GOC和人重新ID更具挑战性。为了弥合FGVC的无监督和监督学习之间的差距,我们研究了监督和无监督的FGVC之间的性能差距(包括特征提取,聚类和对比度学习)。此外,我们提出了一种简单,有效和实用的方法,称为UFCL,以减轻差距。涉及三个关键问题和改进:首先,我们引入了强大而强大的骨干RESNET50-IBN,当我们将Imagenet预训练的模型传输到FGVC任务时,它具有域适应性的能力。接下来,我们建议引入HDBSCAN,而不是DBSCAN进行聚类,这可以为相邻类别生成更好的类别群,而超参数较少。最后,我们提出了一种加权特征代理及其更新机制,通过使用不可避免的噪声的伪标签来进行对比学习,这可以改善学习网络参数的优化过程。我们的UFCL的有效性在CUB-200-2011,牛津流,牛津宠物,斯坦福 - 狗,斯坦福 - 卡车和FGVC-Aircraft数据集中进行了验证。在无监督的FGVC设置下,我们实现了最先进的结果,并分析了提供实际指导的关键因素和重要参数。

Unsupervised learning technology has caught up with or even surpassed supervised learning technology in general object classification (GOC) and person re-identification (re-ID). However, it is found that the unsupervised learning of fine-grained visual classification (FGVC) is more challenging than GOC and person re-ID. In order to bridge the gap between unsupervised and supervised learning for FGVC, we investigate the essential factors (including feature extraction, clustering, and contrastive learning) for the performance gap between supervised and unsupervised FGVC. Furthermore, we propose a simple, effective, and practical method, termed as UFCL, to alleviate the gap. Three key issues are concerned and improved: First, we introduce a robust and powerful backbone, ResNet50-IBN, which has an ability of domain adaptation when we transfer ImageNet pre-trained models to FGVC tasks. Next, we propose to introduce HDBSCAN instead of DBSCAN to do clustering, which can generate better clusters for adjacent categories with fewer hyper-parameters. Finally, we propose a weighted feature agent and its updating mechanism to do contrastive learning by using the pseudo labels with inevitable noise, which can improve the optimization process of learning the parameters of the network. The effectiveness of our UFCL is verified on CUB-200-2011, Oxford-Flowers, Oxford-Pets, Stanford-Dogs, Stanford-Cars and FGVC-Aircraft datasets. Under the unsupervised FGVC setting, we achieve state-of-the-art results, and analyze the key factors and the important parameters to provide a practical guidance.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源