论文标题
从嘈杂的失败中学习逻辑程序
Learning Logic Programs From Noisy Failures
论文作者
论文摘要
归纳逻辑编程(ILP)是机器学习(ML)的一种形式,与许多其他最先进的ML方法相反,通常会产生高度易于解释和可重复使用的模型。但是,许多ILP系统缺乏从任何嘈杂或部分错误分类的培训数据中自然学习的能力。我们介绍了从失败方法到ILP的轻松学习,这是对先前引入的失败方法(LFF)方法的噪声处理修改,该方法无法处理噪声。我们还介绍了新型嘈杂的Popper ILP系统,该系统实现了这种轻松的方法,并且是现有Popper系统的修改。像Popper一样,嘈杂的Popper采用生成测试循环来搜索其假设空间,其中使用了失败的假设来构建假设约束。这些约束用于修剪假设空间,从而使假设搜索更有效。但是,在轻松的环境中,以更宽松的方式产生约束,以避免允许嘈杂的训练数据导致假设限制,从而修剪最佳假设。松弛设置独有的约束是通过假设比较产生的。通过权衡假设的准确性与其大小的大小来产生其他约束,以避免使用最小描述长度过度拟合。我们通过理论证明以及实验结果来支持这种新环境,这表明嘈杂的Popper提高了Popper的噪声处理能力,但以整体运行时效率为代价。
Inductive Logic Programming (ILP) is a form of machine learning (ML) which in contrast to many other state of the art ML methods typically produces highly interpretable and reusable models. However, many ILP systems lack the ability to naturally learn from any noisy or partially misclassified training data. We introduce the relaxed learning from failures approach to ILP, a noise handling modification of the previously introduced learning from failures (LFF) approach which is incapable of handling noise. We additionally introduce the novel Noisy Popper ILP system which implements this relaxed approach and is a modification of the existing Popper system. Like Popper, Noisy Popper takes a generate-test-constrain loop to search its hypothesis space wherein failed hypotheses are used to construct hypothesis constraints. These constraints are used to prune the hypothesis space, making the hypothesis search more efficient. However, in the relaxed setting, constraints are generated in a more lax fashion as to avoid allowing noisy training data to lead to hypothesis constraints which prune optimal hypotheses. Constraints unique to the relaxed setting are generated via hypothesis comparison. Additional constraints are generated by weighing the accuracy of hypotheses against their sizes to avoid overfitting through an application of the minimum description length. We support this new setting through theoretical proofs as well as experimental results which suggest that Noisy Popper improves the noise handling capabilities of Popper but at the cost of overall runtime efficiency.