论文标题
使用生成网络识别痴呆症的数据增强
Data augmentation using generative networks to identify dementia
论文作者
论文摘要
数据限制是用于医疗应用的培训机器学习分类器中最常见的问题之一。由于道德问题和数据隐私,可以招募到此类实验的人数通常小于为非医疗保健数据集做出贡献的参与者的数量。最近的研究表明,生成模型可以用作有效的数据增强方法,最终可以帮助训练更强大的分类器稀疏数据域。许多研究证明,这种数据增强技术适用于图像和音频数据集。在本文中,我们研究了类似方法在不同类型的语音和基于音频的特征中的应用,这些功能从与我们的自动痴呆症检测系统记录的相互作用中提取。使用两个生成模型,我们展示了生成的合成样品如何改善基于DNN的分类器的性能。各种自动编码器提高了四向分类器的F评分,该分类器将记忆诊所中典型患者群体区分开来,从58%到约74%,提高了16%
Data limitation is one of the most common issues in training machine learning classifiers for medical applications. Due to ethical concerns and data privacy, the number of people that can be recruited to such experiments is generally smaller than the number of participants contributing to non-healthcare datasets. Recent research showed that generative models can be used as an effective approach for data augmentation, which can ultimately help to train more robust classifiers sparse data domains. A number of studies proved that this data augmentation technique works for image and audio data sets. In this paper, we investigate the application of a similar approach to different types of speech and audio-based features extracted from interactions recorded with our automatic dementia detection system. Using two generative models we show how the generated synthesized samples can improve the performance of a DNN based classifier. The variational autoencoder increased the F-score of a four-way classifier distinguishing the typical patient groups seen in memory clinics from 58% to around 74%, a 16% improvement