论文标题

知识学习的师生建筑:一项调查

Teacher-Student Architecture for Knowledge Learning: A Survey

论文作者

Hu, Chengming, Li, Xuan, Liu, Dan, Chen, Xi, Wang, Ju, Liu, Xue

论文摘要

尽管深度神经网络(DNN)表现出很强的能力解决了许多领域的大规模问题,但是很难将具有大量参数的DNN部署到实时系统中。为了解决这个问题,首先在知识蒸馏中使用了教师建筑,简单的学生网络可以实现与深层教师网络相当的表现。最近,教师学生建筑已被有效而广泛地接受了各种知识学习目标,包括知识蒸馏,知识扩展,知识适应和多任务学习。借助教师学生的体系结构,当前的研究能够通过轻巧有效的学生网络实现多个知识学习目标。与现有的知识蒸馏调查不同,这项调查详细讨论了具有多个知识学习目标的教师学生体系结构。此外,我们系统地介绍了知识学习期间的知识构建和优化过程,然后分析已利用的各种教师建筑和有效的学习方案来学习代表和强大的知识。本文还根据不同的目的(即分类,识别和一代)总结了教师架构的最新应用。最后,研究了知识学习的潜在研究方向,分别研究了教师学生的建筑设计,知识质量和基于回归学习的理论研究。通过这项综合调查,行业从业人员和学术界都可以学习有关多个知识学习目标的教师建筑的有见地的指南。

Although Deep Neural Networks (DNNs) have shown a strong capacity to solve large-scale problems in many areas, such DNNs with voluminous parameters are hard to be deployed in a real-time system. To tackle this issue, Teacher-Student architectures were first utilized in knowledge distillation, where simple student networks can achieve comparable performance to deep teacher networks. Recently, Teacher-Student architectures have been effectively and widely embraced on various knowledge learning objectives, including knowledge distillation, knowledge expansion, knowledge adaption, and multi-task learning. With the help of Teacher-Student architectures, current studies are able to achieve multiple knowledge-learning objectives through lightweight and effective student networks. Different from the existing knowledge distillation surveys, this survey detailedly discusses Teacher-Student architectures with multiple knowledge learning objectives. In addition, we systematically introduce the knowledge construction and optimization process during the knowledge learning and then analyze various Teacher-Student architectures and effective learning schemes that have been leveraged to learn representative and robust knowledge. This paper also summarizes the latest applications of Teacher-Student architectures based on different purposes (i.e., classification, recognition, and generation). Finally, the potential research directions of knowledge learning are investigated on the Teacher-Student architecture design, the quality of knowledge, and the theoretical studies of regression-based learning, respectively. With this comprehensive survey, both industry practitioners and the academic community can learn insightful guidelines about Teacher-Student architectures on multiple knowledge learning objectives.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源