论文标题
自动结核病和共同研究使用深度学习
Automatic Tuberculosis and COVID-19 cough classification using deep learning
论文作者
论文摘要
我们提出了一个基于深度学习的自动咳嗽分类器,可以区分结核病(TB)与19009咳嗽和健康咳嗽的咳嗽。 TB和COVID-19都是呼吸道疾病,具有传染性,具有咳嗽为主要症状,每年夺去了数千人的生命。在室内和室外设置都收集了咳嗽的录音,并使用来自全球受试者的智能手机上传,从而包含各种噪声。该咳嗽数据包括1.68小时的结核病咳嗽,18.54分钟的Covid-19-19和1.69小时的健康咳嗽,47例TB患者,229名Covid-19患者和1498名健康患者,并用于培训和评估CNN,LSTM和RESNET50。这三个深层体系结构在2.14小时的打喷嚏,2.91小时的语音和2.79小时的噪音中也进行了预训练,以提高性能。通过使用SMOTE数据平衡技术并使用诸如F1得分和AUC之类的性能指标来解决我们数据集中的类不平衡。我们的研究表明,分别从预训练的RESNET50中获得了最高的0.9259和0.8631的F1分数,分别是两级(TB与Covid-19)和三级(TB VS VS COVID-19与健康)的咳嗽分类。深度转移学习的应用改善了分类器的性能,并使它们更加坚固,因为它们在交叉验证折叠上更好地概括了。他们的性能超过了世界卫生组织(WHO)设定的结核病分类测试要求。产生最佳性能的功能包含MFCC的高阶,这表明人耳朵无法感知TB和Covid-19咳嗽之间的差异。这种类型的咳嗽音频分类是非接触,具有成本效益的,并且可以轻松地部署在智能手机上,因此它可以成为TB和COVID-19筛查的绝佳工具。
We present a deep learning based automatic cough classifier which can discriminate tuberculosis (TB) coughs from COVID-19 coughs and healthy coughs. Both TB and COVID-19 are respiratory diseases, contagious, have cough as a predominant symptom and claim thousands of lives each year. The cough audio recordings were collected at both indoor and outdoor settings and also uploaded using smartphones from subjects around the globe, thus containing various levels of noise. This cough data include 1.68 hours of TB coughs, 18.54 minutes of COVID-19 coughs and 1.69 hours of healthy coughs from 47 TB patients, 229 COVID-19 patients and 1498 healthy patients and were used to train and evaluate a CNN, LSTM and Resnet50. These three deep architectures were also pre-trained on 2.14 hours of sneeze, 2.91 hours of speech and 2.79 hours of noise for improved performance. The class-imbalance in our dataset was addressed by using SMOTE data balancing technique and using performance metrics such as F1-score and AUC. Our study shows that the highest F1-scores of 0.9259 and 0.8631 have been achieved from a pre-trained Resnet50 for two-class (TB vs COVID-19) and three-class (TB vs COVID-19 vs healthy) cough classification tasks, respectively. The application of deep transfer learning has improved the classifiers' performance and makes them more robust as they generalise better over the cross-validation folds. Their performances exceed the TB triage test requirements set by the world health organisation (WHO). The features producing the best performance contain higher order of MFCCs suggesting that the differences between TB and COVID-19 coughs are not perceivable by the human ear. This type of cough audio classification is non-contact, cost-effective and can easily be deployed on a smartphone, thus it can be an excellent tool for both TB and COVID-19 screening.