利用可解释的AI进行量化和修剪深度神经网络

论文标题

利用可解释的AI进行量化和修剪深度神经网络

Utilizing Explainable AI for Quantization and Pruning of Deep Neural Networks

论文作者

Sabih, Muhammad, Hannig, Frank, Teich, Juergen

论文摘要

对于许多应用程序，利用DNN（深度神经网络）需要以优化的方式对目标体系结构进行实施，以涉及能源消耗，内存需求，吞吐量等。DNN压缩用于降低DNN在硬件上部署前DNN的内存足迹和复杂性。最新理解和解释AI（人工智能）方法的努力导致了一个新的研究领域，称为AI可解释的AI。可解释的AI方法使我们能够更好地了解DNN的内部工作，例如不同神经元和特征的重要性。可解释的AI的概念为改善DNN压缩方法（例如量化和修剪）提供了一个机会，以几种方式尚未得到充分探索。在本文中，我们利用可解释的AI方法：主要移动方法。我们将这些方法用于（1）DNN的修剪；这包括\ ac {cnn}过滤器的结构化和非结构化修剪以及使用聚类算法对DNN权重的不均匀量化的修剪权重，以及完全连接的层的修剪权重；这也称为重量共享，（3）基于整数的混合精液量化；这是DNN的每一层都可以使用不同数量的整数位。我们将典型的图像分类数据集与常见的深度学习图像分类模型一起进行评估。在所有这三种情况下，我们都会通过在DNN压缩中使用可解释的AI来表现出重大改进以及新的见解和机会。

For many applications, utilizing DNNs (Deep Neural Networks) requires their implementation on a target architecture in an optimized manner concerning energy consumption, memory requirement, throughput, etc. DNN compression is used to reduce the memory footprint and complexity of a DNN before its deployment on hardware. Recent efforts to understand and explain AI (Artificial Intelligence) methods have led to a new research area, termed as explainable AI. Explainable AI methods allow us to understand better the inner working of DNNs, such as the importance of different neurons and features. The concepts from explainable AI provide an opportunity to improve DNN compression methods such as quantization and pruning in several ways that have not been sufficiently explored so far. In this paper, we utilize explainable AI methods: mainly DeepLIFT method. We use these methods for (1) pruning of DNNs; this includes structured and unstructured pruning of \ac{CNN} filters pruning as well as pruning weights of fully connected layers, (2) non-uniform quantization of DNN weights using clustering algorithm; this is also referred to as Weight Sharing, and (3) integer-based mixed-precision quantization; this is where each layer of a DNN may use a different number of integer bits. We use typical image classification datasets with common deep learning image classification models for evaluation. In all these three cases, we demonstrate significant improvements as well as new insights and opportunities from the use of explainable AI in DNN compression.

下载PDF全文

下载文献需遵守相关版权规定

论文标题