论文标题
基因编程自然适合发展包装合奏
Genetic Programming is Naturally Suited to Evolve Bagging Ensembles
论文作者
论文摘要
通过包装进行学习集合可以大大改善低偏置,高变化估计器的概括性能,包括遗传编程(GP)演变的估计器。为了高效,用于发展(包装)合奏的现代GP算法通常依赖几种(通常是相互连接的)机制和各自的超参数,最终损害了易用性。在本文中,我们提供了可能不保证这种复杂性的实验证据。我们表明,对健身评估和选择的微小变化足以使简单的传统GP算法有效地进化合奏。我们建议的关键是利用行李的工作方式来计算人口中的每个人,以多种健身价值(而不是一个),其成本仅略高于正常健身评估之一。从先前研究中采取和复制的分类和回归任务的实验比较表明,我们的算法票价非常适合针对最先进的集合和非集合GP算法。我们通过(i)缩放整体尺寸,(ii)消除选择变化,(iii)观察传统子树变异引起的可变性。代码:https://github.com/marcovirgolin/2segp。
Learning ensembles by bagging can substantially improve the generalization performance of low-bias, high-variance estimators, including those evolved by Genetic Programming (GP). To be efficient, modern GP algorithms for evolving (bagging) ensembles typically rely on several (often inter-connected) mechanisms and respective hyper-parameters, ultimately compromising ease of use. In this paper, we provide experimental evidence that such complexity might not be warranted. We show that minor changes to fitness evaluation and selection are sufficient to make a simple and otherwise-traditional GP algorithm evolve ensembles efficiently. The key to our proposal is to exploit the way bagging works to compute, for each individual in the population, multiple fitness values (instead of one) at a cost that is only marginally higher than the one of a normal fitness evaluation. Experimental comparisons on classification and regression tasks taken and reproduced from prior studies show that our algorithm fares very well against state-of-the-art ensemble and non-ensemble GP algorithms. We further provide insights into the proposed approach by (i) scaling the ensemble size, (ii) ablating the changes to selection, (iii) observing the evolvability induced by traditional subtree variation. Code: https://github.com/marcovirgolin/2SEGP.