论文标题
通过动态健身评估改善汽车系统的概括
Improving generalisation of AutoML systems with dynamic fitness evaluations
论文作者
论文摘要
机器学习开发人员面临的常见问题是过于拟合的,也就是说,将管道与训练数据非常紧密地拟合,而训练数据却降低了看不见的数据。自动化的机器学习旨在使开发人员摆脱管道创建负担(或至少放松),但这种过度拟合的问题可能会持续存在。实际上,当我们迭代地优化内部交叉验证的性能(最常见的\ textit {k} -fold)时,这可能会成为一个问题。尽管这种内部交叉验证希望减少这种过度拟合,但我们表明我们仍然可能会过度适合使用的特定折叠。在这项工作中,我们旨在通过引入动态健身评估来解决这个问题,该评估近似于重复\ textit {k} - 折叠式验证,而单\ textit {k} {k} fold {k} fold的成本几乎没有比典型的重复\ textit \ textit \ textit {k} -fold低得多。结果表明,当时间等同时,提出的适应性函数会比使用内部单\ textit {k} -fold的当前最新基线方法可显着改善。此外,在现有的进化计算方法之上实现了提出的扩展名,并且可以从本质上提供概括/测试性能的自由提升。
A common problem machine learning developers are faced with is overfitting, that is, fitting a pipeline too closely to the training data that the performance degrades for unseen data. Automated machine learning aims to free (or at least ease) the developer from the burden of pipeline creation, but this overfitting problem can persist. In fact, this can become more of a problem as we look to iteratively optimise the performance of an internal cross-validation (most often \textit{k}-fold). While this internal cross-validation hopes to reduce this overfitting, we show we can still risk overfitting to the particular folds used. In this work, we aim to remedy this problem by introducing dynamic fitness evaluations which approximate repeated \textit{k}-fold cross-validation, at little extra cost over single \textit{k}-fold, and far lower cost than typical repeated \textit{k}-fold. The results show that when time equated, the proposed fitness function results in significant improvement over the current state-of-the-art baseline method which uses an internal single \textit{k}-fold. Furthermore, the proposed extension is very simple to implement on top of existing evolutionary computation methods, and can provide essentially a free boost in generalisation/testing performance.