通过重量间隔约束的持续学习并保证

论文标题

通过重量间隔约束的持续学习并保证

Continual Learning with Guarantees via Weight Interval Constraints

论文作者

Wołczyk, Maciej, Piczak, Karol J., Wójcik, Bartosz, Pustelnik, Łukasz, Morawiecki, Paweł, Tabor, Jacek, Trzciński, Tomasz, Spurek, Przemysław

论文摘要

我们引入了一个新的培训范式，该范围对神经网络参数空间进行间隔约束以控制遗忘。当代持续学习（CL）方法从数据流有效地培训神经网络，同时减少灾难性遗忘的负面影响，但它们不能提供任何确保随着时间的流逝不会无法控制的网络绩效。在这项工作中，我们展示了如何通过将模型的持续学习作为其参数空间的持续收缩来遗忘。为此，我们提出了Hypertrectangle训练，这是一种新的训练方法，其中每个任务都由参数空间中的超矩形表示，完全包含在先前任务的超矩形中。这种配方将NP-HARD CL问题降低到多项式时间，同时提供了完全抗忘记的弹性。我们通过开发Intercontinet（间隔持续学习）算法来验证我们的主张，该算法利用间隔算术来有效地将参数区域建模为超矩形。通过实验结果，我们表明我们的方法在持续的学习设置中表现良好，而无需存储以前的任务中的数据。

We introduce a new training paradigm that enforces interval constraints on neural network parameter space to control forgetting. Contemporary Continual Learning (CL) methods focus on training neural networks efficiently from a stream of data, while reducing the negative impact of catastrophic forgetting, yet they do not provide any firm guarantees that network performance will not deteriorate uncontrollably over time. In this work, we show how to put bounds on forgetting by reformulating continual learning of a model as a continual contraction of its parameter space. To that end, we propose Hyperrectangle Training, a new training methodology where each task is represented by a hyperrectangle in the parameter space, fully contained in the hyperrectangles of the previous tasks. This formulation reduces the NP-hard CL problem back to polynomial time while providing full resilience against forgetting. We validate our claim by developing InterContiNet (Interval Continual Learning) algorithm which leverages interval arithmetic to effectively model parameter regions as hyperrectangles. Through experimental results, we show that our approach performs well in a continual learning setup without storing data from previous tasks.

下载PDF全文

下载文献需遵守相关版权规定

论文标题