论文标题
在COVID-19爆发中对活动案例的预测中的样本量小诅咒
Curse of Small Sample Size in Forecasting of the Active Cases in COVID-19 Outbreak
论文作者
论文摘要
在Covid-19大流行期间,已经进行了大量尝试预测病例数量和该大流行的未来趋势的尝试。但是,他们无法以可靠的方式预测Covid-19爆发的基本特征的中等和长期演变,这是可接受的精度。本文在这个特定的预测问题中为机器学习模型的失败提供了解释。该论文表明,简单的线性回归模型可靠地提供了高预测准确性值,但仅在2周时期提供,并且相对复杂的机器学习模型具有以低误差学习长期预测的潜力,无法实现具有高概括能力的良好预测。论文中建议缺乏足够数量的样品是预测模型的预测性能低。根据交叉验证预测误差来衡量有关活动案例的预测结果的可靠性,这些误差被用作预报员的概括错误的期望。为了利用与活动案例最相关的信息,我们在各种变量上执行特征选择。我们应用不同的特征选择方法,即成对相关性,递归特征选择和特征选择,并使用套索回归并将它们相互比较,并与不采用任何特征选择的模型进行比较。此外,我们比较线性回归,多层感知器和长期术语内存模型,每个模型都用于预测活动案例以及上述特征选择方法。我们的结果表明,仅由于COVID-19数据的样本大小较小,对具有高概括能力的活性病例的准确预测才能达到3天。
During the COVID-19 pandemic, a massive number of attempts on the predictions of the number of cases and the other future trends of this pandemic have been made. However, they fail to predict, in a reliable way, the medium and long term evolution of fundamental features of COVID-19 outbreak within acceptable accuracy. This paper gives an explanation for the failure of machine learning models in this particular forecasting problem. The paper shows that simple linear regression models provide high prediction accuracy values reliably but only for a 2-weeks period and that relatively complex machine learning models, which have the potential of learning long term predictions with low errors, cannot achieve to obtain good predictions with possessing a high generalization ability. It is suggested in the paper that the lack of a sufficient number of samples is the source of low prediction performance of the forecasting models. The reliability of the forecasting results about the active cases is measured in terms of the cross-validation prediction errors, which are used as expectations for the generalization errors of the forecasters. To exploit the information, which is of most relevant with the active cases, we perform feature selection over a variety of variables. We apply different feature selection methods, namely the Pairwise Correlation, Recursive Feature Selection, and feature selection by using the Lasso regression and compare them to each other and also with the models not employing any feature selection. Furthermore, we compare Linear Regression, Multi-Layer Perceptron, and Long-Short Term Memory models each of which is used for prediction active cases together with the mentioned feature selection methods. Our results show that the accurate forecasting of the active cases with high generalization ability is possible up to 3 days only because of the small sample size of COVID-19 data.