metamask：重新审视自我监督学习的尺寸混杂因素

论文标题

metamask：重新审视自我监督学习的尺寸混杂因素

MetaMask: Revisiting Dimensional Confounder for Self-Supervised Learning

论文作者

Li, Jiangmeng, Qiang, Wenwen, Zhang, Yanan, Mo, Wenyi, Zheng, Changwen, Su, Bing, Xiong, Hui

论文摘要

作为一种成功的自学学习方法，对比学习旨在学习在输入样本的扭曲中共享不变的信息。尽管对比度学习在抽样策略和架构设计方面取得了持续的进步，但仍然存在两个持续的缺陷：任务 - 近距离信息的干扰和样本效率低下，这与琐碎的恒定解决方案的反复存在有关。从维度分析的角度来看，我们发现尺寸的冗余和尺寸混杂因素是现象背后的内在问题，并提供了实验证据来支持我们的观点。我们进一步提出了一种简单而有效的方法metamask，这是元学习学到的维度面膜的缩写，以学习针对维度冗余和混杂因素的表示形式。 MetAmask采用冗余技术来解决维数冗余问题，并创新地引入了尺寸面具，以减少包含混杂因子的特定维度的梯度效应，该梯度通过使用元学习范式进行培训，以采用典型的自我自我掩护的自我掩盖表现的目标。与典型的对比方法相比，我们提供了坚实的理论分析以证明元掩体可以获得下游分类的更严格的风险范围。从经验上讲，我们的方法在各种基准上实现了最先进的性能。

As a successful approach to self-supervised learning, contrastive learning aims to learn invariant information shared among distortions of the input sample. While contrastive learning has yielded continuous advancements in sampling strategy and architecture design, it still remains two persistent defects: the interference of task-irrelevant information and sample inefficiency, which are related to the recurring existence of trivial constant solutions. From the perspective of dimensional analysis, we find out that the dimensional redundancy and dimensional confounder are the intrinsic issues behind the phenomena, and provide experimental evidence to support our viewpoint. We further propose a simple yet effective approach MetaMask, short for the dimensional Mask learned by Meta-learning, to learn representations against dimensional redundancy and confounder. MetaMask adopts the redundancy-reduction technique to tackle the dimensional redundancy issue and innovatively introduces a dimensional mask to reduce the gradient effects of specific dimensions containing the confounder, which is trained by employing a meta-learning paradigm with the objective of improving the performance of masked representations on a typical self-supervised task. We provide solid theoretical analyses to prove MetaMask can obtain tighter risk bounds for downstream classification compared to typical contrastive methods. Empirically, our method achieves state-of-the-art performance on various benchmarks.

下载PDF全文

下载文献需遵守相关版权规定

论文标题