在面向应用程序的环境中使用深度学习方法的持续学习

论文标题

在面向应用程序的环境中使用深度学习方法的持续学习

Continual Learning with Deep Learning Methods in an Application-Oriented Context

论文作者

Pfülb, Benedikt

论文摘要

抽象知识在许多基于计算机的应用程序中都基于深度。人工智能的重要研究领域（AI）介绍了从数据中自动推导知识。机器学习提供了算法。研究领域的重点是生物学启发的学习算法的发展。各自的机器学习方法基于神经概念，因此它们可以从数据中系统地得出知识并存储它。可以将一种可以归类为“深度学习”模型的机器学习算法称为深神经网络（DNNS）。 DNN由多个通过使用反向传播算法训练的层的人工神经元组成。这些深度学习方法具有从高维数据中推断和存储复杂知识的惊人功能。但是，DNN受到阻止新知识被添加到现有基础的问题的影响。持续积累知识的能力是导致进化的重要因素，因此是强大AIS发展的先决条件。所谓的“灾难性遗忘”（CF）效应使DNN在对新数据分布进行了几次训练迭代后立即失去了已经衍生的知识。只有通过过去和新数据的联合数据分布进行的精力昂贵的重新培训才能使整个新知识集的抽象抽象。为了抵消效果，已经并且仍在开发各种技术，以减轻甚至解决CF问题的目标。这些发表的CF回避研究通常意味着其方法对各种持续学习任务的有效性。本文是在通过深度学习方法的连续机器学习的背景下设定的。第一部分涉及一个...的发展

Abstract knowledge is deeply grounded in many computer-based applications. An important research area of Artificial Intelligence (AI) deals with the automatic derivation of knowledge from data. Machine learning offers the according algorithms. One area of research focuses on the development of biologically inspired learning algorithms. The respective machine learning methods are based on neurological concepts so that they can systematically derive knowledge from data and store it. One type of machine learning algorithms that can be categorized as "deep learning" model is referred to as Deep Neural Networks (DNNs). DNNs consist of multiple artificial neurons arranged in layers that are trained by using the backpropagation algorithm. These deep learning methods exhibit amazing capabilities for inferring and storing complex knowledge from high-dimensional data. However, DNNs are affected by a problem that prevents new knowledge from being added to an existing base. The ability to continuously accumulate knowledge is an important factor that contributed to evolution and is therefore a prerequisite for the development of strong AIs. The so-called "catastrophic forgetting" (CF) effect causes DNNs to immediately loose already derived knowledge after a few training iterations on a new data distribution. Only an energetically expensive retraining with the joint data distribution of past and new data enables the abstraction of the entire new set of knowledge. In order to counteract the effect, various techniques have been and are still being developed with the goal to mitigate or even solve the CF problem. These published CF avoidance studies usually imply the effectiveness of their approaches for various continual learning tasks. This dissertation is set in the context of continual machine learning with deep learning methods. The first part deals with the development of an ...

下载PDF全文

下载文献需遵守相关版权规定

论文标题