学习学习的时间：在持续学习中重播安排

论文标题

学习学习的时间：在持续学习中重播安排

Learn the Time to Learn: Replay Scheduling in Continual Learning

论文作者

Klasson, Marcus, Kjellström, Hedvig, Zhang, Cheng

论文摘要

Replay methods are known to be successful at mitigating catastrophic forgetting in continual learning scenarios despite having limited access to historical data. However, storing historical data is cheap in many real-world settings, yet replaying all historical data is often prohibited due to processing time constraints. In such settings, we propose that continual learning systems should learn the time to learn and schedule which tasks to replay at different time steps. We first demonstrate the benefits of our proposal by using Monte Carlo tree search to find a proper replay schedule, and show that the found replay schedules can outperform fixed scheduling policies when combined with various replay methods in different continual learning settings.此外，我们提出了一个通过增强学习来学习重播计划策略的框架。 We show that the learned policies can generalize better in new continual learning scenarios compared to equally replaying all seen tasks, without added computational cost.我们的研究揭示了学习在持续学习中学习时间的重要性，这使当前的研究更加接近现实世界的需求。

Replay methods are known to be successful at mitigating catastrophic forgetting in continual learning scenarios despite having limited access to historical data. However, storing historical data is cheap in many real-world settings, yet replaying all historical data is often prohibited due to processing time constraints. In such settings, we propose that continual learning systems should learn the time to learn and schedule which tasks to replay at different time steps. We first demonstrate the benefits of our proposal by using Monte Carlo tree search to find a proper replay schedule, and show that the found replay schedules can outperform fixed scheduling policies when combined with various replay methods in different continual learning settings. Additionally, we propose a framework for learning replay scheduling policies with reinforcement learning. We show that the learned policies can generalize better in new continual learning scenarios compared to equally replaying all seen tasks, without added computational cost. Our study reveals the importance of learning the time to learn in continual learning, which brings current research closer to real-world needs.

下载PDF全文

下载文献需遵守相关版权规定

论文标题