论文标题

通过开发项目学习,用于皮肤病学诊断的联合对比度学习

Federated Contrastive Learning for Dermatological Disease Diagnosis via On-device Learning

论文作者

Wu, Yawen, Zeng, Dewen, Wang, Zhepeng, Sheng, Yi, Yang, Lei, James, Alaina J., Shi, Yiyu, Hu, Jingtong

论文摘要

深度学习模型已被部署在越来越多的优势和移动设备中,以提供医疗保健。这些模型依靠大量标记数据来培训以实现高精度。但是,对于诸如皮肤病学诊断的医学应用,移动皮肤病学助理收集的私人数据存在于患者的分布式移动设备上,并且每种设备的数据仅具有有限的数据。直接从有限的数据中学习会大大恶化学习模型的性能。联合学习(FL)可以通过使用在设备上分发的数据来培训模型,同时保留数据本地隐私。 FL上的现有作品假设所有数据都具有地面真相标签。但是,医疗数据通常没有任何随附的标签,因为标签需要专业知识,并导致高昂的人工成本。最近开发的自我监督学习方法,对比学习(CL)可以利用未标记的数据预先培训A模型,然后该模型在有限的标记数据上进行了微调数据以进行皮肤病学疾病诊断。但是,仅将CL与FL作为联合对比度学习(FCL)相结合,将导致学习效率低下,因为CL需要多种学习数据,但每个设备只有有限的数据。在这项工作中,我们提出了一个用于皮肤病学疾病诊断有限标签的设备FCL框架。功能在FCL预训练过程中共享,以提供多种和准确的对比信息。之后,预先训练的模型通过在每个设备上独立地与本地标记的数据进行微调或与所有设备上有监督的联合学习协作。关于皮肤病学数据集的实验表明,与最先进的方法相比,提出的框架有效地改善了皮肤病学诊断的回忆和精度。

Deep learning models have been deployed in an increasing number of edge and mobile devices to provide healthcare. These models rely on training with a tremendous amount of labeled data to achieve high accuracy. However, for medical applications such as dermatological disease diagnosis, the private data collected by mobile dermatology assistants exist on distributed mobile devices of patients, and each device only has a limited amount of data. Directly learning from limited data greatly deteriorates the performance of learned models. Federated learning (FL) can train models by using data distributed on devices while keeping the data local for privacy. Existing works on FL assume all the data have ground-truth labels. However, medical data often comes without any accompanying labels since labeling requires expertise and results in prohibitively high labor costs. The recently developed self-supervised learning approach, contrastive learning (CL), can leverage the unlabeled data to pre-train a model, after which the model is fine-tuned on limited labeled data for dermatological disease diagnosis. However, simply combining CL with FL as federated contrastive learning (FCL) will result in ineffective learning since CL requires diverse data for learning but each device only has limited data. In this work, we propose an on-device FCL framework for dermatological disease diagnosis with limited labels. Features are shared in the FCL pre-training process to provide diverse and accurate contrastive information. After that, the pre-trained model is fine-tuned with local labeled data independently on each device or collaboratively with supervised federated learning on all devices. Experiments on dermatological disease datasets show that the proposed framework effectively improves the recall and precision of dermatological disease diagnosis compared with state-of-the-art methods.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源