鲁棒性感知的2位量化具有神经网络的实时性能

论文标题

鲁棒性感知的2位量化具有神经网络的实时性能

Robustness-aware 2-bit quantization with real-time performance for neural network

论文作者

Li, Xiaobin, Jiang, Hongxu, Huang, Shuangxi, Tian, Fangzheng

论文摘要

量化位精度降低的量化神经网络（NN）是减少计算和内存资源需求并在机器学习中起着至关重要的作用的有效解决方案。但是，避免由于其数值近似和较低的冗余而避免明显的准确性降低仍然具有挑战性。在本文中，提出了一种新颖的鲁棒性2位量化方案，用于二进制NN和生成对抗网络（GAN）的NN基础，Witch通过丰富二进制NN的信息来提高性能，有效地提取结构信息并考虑量化NN的鲁棒性。具体而言，使用Shift添加操作来替换量化过程中的多重蓄能，可以有效地加快NN的速度。同时，提出了原始NN和量化NN之间的结构损失，以便在量化后保留数据的结构信息。从NN中学到的结构信息不仅在改善性能中起着重要作用，而且还可以通过将Lipschitz的约束应用于结构损失来进一步调整量化网络。此外，我们还首次考虑了量化NN的鲁棒性，并通过引入不敏感的频谱规范项提出了非敏感的扰动损失函数。这些实验是在CIFAR-10和Imagenet数据集上进行的，该数据集具有流行的NN（例如Moblienetv2，Squeezenet，Resnet20等）。实验结果表明，在2位精确的情况下，所提出的算法比最先进的量化方法更具竞争力。同时，实验结果还表明，在FGSM对抗样品攻击下所提出的方法是鲁棒的。

Quantized neural network (NN) with a reduced bit precision is an effective solution to reduces the computational and memory resource requirements and plays a vital role in machine learning. However, it is still challenging to avoid the significant accuracy degradation due to its numerical approximation and lower redundancy. In this paper, a novel robustness-aware 2-bit quantization scheme is proposed for NN base on binary NN and generative adversarial network(GAN), witch improves the performance by enriching the information of binary NN, efficiently extract the structural information and considering the robustness of the quantized NN. Specifically, using shift addition operation to replace the multiply-accumulate in the quantization process witch can effectively speed the NN. Meanwhile, a structural loss between the original NN and quantized NN is proposed to such that the structural information of data is preserved after quantization. The structural information learned from NN not only plays an important role in improving the performance but also allows for further fine tuning of the quantization network by applying the Lipschitz constraint to the structural loss. In addition, we also for the first time take the robustness of the quantized NN into consideration and propose a non-sensitive perturbation loss function by introducing an extraneous term of spectral norm. The experiments are conducted on CIFAR-10 and ImageNet datasets with popular NN( such as MoblieNetV2, SqueezeNet, ResNet20, etc). The experimental results show that the proposed algorithm is more competitive under 2-bit-precision than the state-of-the-art quantization methods. Meanwhile, the experimental results also demonstrate that the proposed method is robust under the FGSM adversarial samples attack.

下载PDF全文

下载文献需遵守相关版权规定

论文标题