论文标题
通过辐射副本进行自我监督的学习
Self-Supervised Learning Through Efference Copies
论文作者
论文摘要
自我监督学习(SSL)方法旨在利用机器学习(ML)的大量未标记数据(ML),但是基本原理通常是特定于方法的。从体现学习的生物学第一原理中得出的SSL框架可以统一各种SSL方法,有助于阐明大脑中的学习,并可能改善ML。 SSL通常将每个训练数据点转换为一对视图,将这种配对的知识用作一个积极的(即非对抗性的)自我提出的符号,并有可能反对它与无关的(即对比)负面示例。在这里,我们表明,这种类型的自我选择是神经科学(EFERENCE副本(EC))概念的不完整实现。具体而言,大脑还通过效率(即电动机命令)转换环境,但是它向其发送了一个完整命令的EC,即不仅仅是SSL符号。此外,其行动表示可能以自我为中心。从这样的原则基础中,我们正式恢复并扩展了SSL方法,例如SIMCLR,BYOL和RESIC,在共同的理论框架下,即通过辐射副本(S-TEC)进行自学。从经验上讲,S-TEC重组有意义地进行了阶级和之间的表示。这表现为在图像分类,分割,对象检测和音频中最近强的SSL基线的改善。这些结果假设大脑的电动机输出对其感觉表示产生了可检验的积极影响。
Self-supervised learning (SSL) methods aim to exploit the abundance of unlabelled data for machine learning (ML), however the underlying principles are often method-specific. An SSL framework derived from biological first principles of embodied learning could unify the various SSL methods, help elucidate learning in the brain, and possibly improve ML. SSL commonly transforms each training datapoint into a pair of views, uses the knowledge of this pairing as a positive (i.e. non-contrastive) self-supervisory sign, and potentially opposes it to unrelated, (i.e. contrastive) negative examples. Here, we show that this type of self-supervision is an incomplete implementation of a concept from neuroscience, the Efference Copy (EC). Specifically, the brain also transforms the environment through efference, i.e. motor commands, however it sends to itself an EC of the full commands, i.e. more than a mere SSL sign. In addition, its action representations are likely egocentric. From such a principled foundation we formally recover and extend SSL methods such as SimCLR, BYOL, and ReLIC under a common theoretical framework, i.e. Self-supervision Through Efference Copies (S-TEC). Empirically, S-TEC restructures meaningfully the within- and between-class representations. This manifests as improvement in recent strong SSL baselines in image classification, segmentation, object detection, and in audio. These results hypothesize a testable positive influence from the brain's motor outputs onto its sensory representations.