论文标题
使用多种机器学习和深度学习方法基于电子健康记录(EHR)数据的患者严重性状态分类
Patients' Severity States Classification based on Electronic Health Record (EHR) Data using Multiple Machine Learning and Deep Learning Approaches
论文作者
论文摘要
这项研究介绍了使用多种机器学习和深度学习方法在一定时间范围内根据其电子健康记录对患者的严重性状态进行分类的检查。建议的方法使用从开源平台收集的EHR数据集来分类严重性。在这项研究中使用了一些工具,例如使用OpenRefine进行预处理,RapidMiner用于实现三种算法(快速大幅度,广义线性模型,多层馈电神经网络),并使用Tableau可视化数据,用于实现算法,我们使用了Google Colab。在这里,我们实施了几种受监督和无监督的算法以及半监督和深度学习算法。实验结果表明,高参数随机森林的表现优于所有其他监督机器学习算法,其精度为76%,并且普遍的线性算法达到了最高的精度得分78%,而高参数调整的等级层次集群具有86%的精确分数和86%的精确分数和86%的混合物,并具有61%的精确效果。对于大多数无监督的技术,降低降低的结果很大。为了实施深度学习,我们采用了馈送前向神经网络(多层)和半监督学习的快速利润方法。快速较大的保证金表现非常出色,召回得分为84%,F1得分为78%。最后,多层馈电神经网络以75%的精度,75%的精度,87%的召回率,81%的F1得分表现出色。
This research presents an examination of categorizing the severity states of patients based on their electronic health records during a certain time range using multiple machine learning and deep learning approaches. The suggested method uses an EHR dataset collected from an open-source platform to categorize severity. Some tools were used in this research, such as openRefine was used to pre-process, RapidMiner was used for implementing three algorithms (Fast Large Margin, Generalized Linear Model, Multi-layer Feed-forward Neural Network) and Tableau was used to visualize the data, for implementation of algorithms we used Google Colab. Here we implemented several supervised and unsupervised algorithms along with semi-supervised and deep learning algorithms. The experimental results reveal that hyperparameter-tuned Random Forest outperformed all the other supervised machine learning algorithms with 76% accuracy as well as Generalized Linear algorithm achieved the highest precision score 78%, whereas the hyperparameter-tuned Hierarchical Clustering with 86% precision score and Gaussian Mixture Model with 61% accuracy outperformed other unsupervised approaches. Dimensionality Reduction improved results a lot for most unsupervised techniques. For implementing Deep Learning we employed a feed-forward neural network (multi-layer) and the Fast Large Margin approach for semi-supervised learning. The Fast Large Margin performed really well with a recall score of 84% and an F1 score of 78%. Finally, the Multi-layer Feed-forward Neural Network performed admirably with 75% accuracy, 75% precision, 87% recall, 81% F1 score.