论文标题
通过聚类在人类活动识别中进行对比学习来进行负面选择
Negative Selection by Clustering for Contrastive Learning in Human Activity Recognition
论文作者
论文摘要
对比度学习已根据传感器数据应用于人类活动识别(HAR),这是因为其实现具有大量未标记数据和少量标记数据的性能与监督学习相当的能力。对比学习的预训练任务通常是实例歧视,它指定了每个实例属于一个类,但这会将同一类样本视为负示例。这样的预训练任务不利于人类活动识别任务,这主要是分类任务。为了解决这个问题,我们遵循SIMCLR提出了一个新的对比学习框架,该框架通过在HAR中进行聚类来进行负面选择,这称为ClusterClhar。与SIMCLR相比,它通过使用无监督的聚类方法来生成软标签,以掩盖同一群集的其他样品,以避免将它们作为负样本避免,从而重新定义了对比损失函数中的负对。我们使用平均F1得分作为评估度量标准,在三个基准数据集(USC-HAD,MOTIONSESS和UCI-HAR)上评估ClusterClhar。实验结果表明,它的表现优于在自学学习和半监督学习中应用于HAR的所有最新方法。
Contrastive learning has been applied to Human Activity Recognition (HAR) based on sensor data owing to its ability to achieve performance comparable to supervised learning with a large amount of unlabeled data and a small amount of labeled data. The pre-training task for contrastive learning is generally instance discrimination, which specifies that each instance belongs to a single class, but this will consider the same class of samples as negative examples. Such a pre-training task is not conducive to human activity recognition tasks, which are mainly classification tasks. To address this problem, we follow SimCLR to propose a new contrastive learning framework that negative selection by clustering in HAR, which is called ClusterCLHAR. Compared with SimCLR, it redefines the negative pairs in the contrastive loss function by using unsupervised clustering methods to generate soft labels that mask other samples of the same cluster to avoid regarding them as negative samples. We evaluate ClusterCLHAR on three benchmark datasets, USC-HAD, MotionSense, and UCI-HAR, using mean F1-score as the evaluation metric. The experiment results show that it outperforms all the state-of-the-art methods applied to HAR in self-supervised learning and semi-supervised learning.