朝对抗训练的实用彩票票证假设

论文标题

朝对抗训练的实用彩票票证假设

Towards Practical Lottery Ticket Hypothesis for Adversarial Training

论文作者

Li, Bai, Wang, Shiqi, Jia, Yunhan, Lu, Yantao, Zhong, Zhenyu, Carin, Lawrence, Jana, Suman

论文摘要

最近的研究提出了彩票假说，表明，对于深层神经网络，存在可训练的子网络，其表现均等或比具有相称训练步骤的原始模型相同或更好。尽管这一发现是有见地的，但找到适当的子网络需要迭代培训和修剪。高成本限制了彩票假设的应用。我们显示，存在上述子网络的一个子集，这些子网络在培训过程中汇聚的速度明显更快，因此可以减轻成本问题。我们进行了广泛的实验，以表明在各种模型结构中始终存在此类子网络，以实现超参数的限制设置（例如$，精心选择的学习率，修剪比率和模型容量）。作为我们发现的实际应用，我们证明了这样的子网络可以帮助减少对抗性训练的总时间，这是一种改善鲁棒性的标准方法，在CIFAR-10上最多可提高49％\％，以实现最先进的鲁棒性。

Recent research has proposed the lottery ticket hypothesis, suggesting that for a deep neural network, there exist trainable sub-networks performing equally or better than the original model with commensurate training steps. While this discovery is insightful, finding proper sub-networks requires iterative training and pruning. The high cost incurred limits the applications of the lottery ticket hypothesis. We show there exists a subset of the aforementioned sub-networks that converge significantly faster during the training process and thus can mitigate the cost issue. We conduct extensive experiments to show such sub-networks consistently exist across various model structures for a restrictive setting of hyperparameters ($e.g.$, carefully selected learning rate, pruning ratio, and model capacity). As a practical application of our findings, we demonstrate that such sub-networks can help in cutting down the total time of adversarial training, a standard approach to improve robustness, by up to 49\% on CIFAR-10 to achieve the state-of-the-art robustness.

下载PDF全文

下载文献需遵守相关版权规定

论文标题