论文标题
学习在域移动下群集
Learning to Cluster under Domain Shift
论文作者
论文摘要
尽管基于深层体系结构的无监督域适应方法在许多计算机视觉任务中取得了显着的成功,但它们依赖于强有力的假设,即必须可用标记的源数据。在这项工作中,我们克服了这一假设,并解决了当源和目标数据没有注释时,将知识从源转移到目标域的问题。受到最新研究的启发,我们的方法利用了从多个源域收集的数据中的信息来构建一个域 - 不合Snostic聚类模型,然后在目标数据可用时在推理时间进行完善。具体而言,在训练时,我们建议优化一种新型的信息理论损失,该损失与域平衡层相结合,可确保我们的模型学会在丢弃特定领域的特定特征的同时正确地发现语义标签。重要的是,我们的体系结构设计可确保在推理时,由于功能对齐和自upervision,在推理时可以有效地适应目标域而无需访问源数据。我们考虑了几个域适应基准,我们在各种环境中评估了所提出的方法,我们表明我们的方法即使在很少有目标样本的情况下,我们的方法也能够自动发现相关的语义信息,并且在多个域适应基准上产生了最先进的结果。
While unsupervised domain adaptation methods based on deep architectures have achieved remarkable success in many computer vision tasks, they rely on a strong assumption, i.e. labeled source data must be available. In this work we overcome this assumption and we address the problem of transferring knowledge from a source to a target domain when both source and target data have no annotations. Inspired by recent works on deep clustering, our approach leverages information from data gathered from multiple source domains to build a domain-agnostic clustering model which is then refined at inference time when target data become available. Specifically, at training time we propose to optimize a novel information-theoretic loss which, coupled with domain-alignment layers, ensures that our model learns to correctly discover semantic labels while discarding domain-specific features. Importantly, our architecture design ensures that at inference time the resulting source model can be effectively adapted to the target domain without having access to source data, thanks to feature alignment and self-supervision. We evaluate the proposed approach in a variety of settings, considering several domain adaptation benchmarks and we show that our method is able to automatically discover relevant semantic information even in presence of few target samples and yields state-of-the-art results on multiple domain adaptation benchmarks.