论文标题
朝对抗训练的实用彩票票证假设
Towards Practical Lottery Ticket Hypothesis for Adversarial Training
论文作者
论文摘要
最近的研究提出了彩票假说,表明,对于深层神经网络,存在可训练的子网络,其表现均等或比具有相称训练步骤的原始模型相同或更好。尽管这一发现是有见地的,但找到适当的子网络需要迭代培训和修剪。高成本限制了彩票假设的应用。我们显示,存在上述子网络的一个子集,这些子网络在培训过程中汇聚的速度明显更快,因此可以减轻成本问题。我们进行了广泛的实验,以表明在各种模型结构中始终存在此类子网络,以实现超参数的限制设置(例如$,精心选择的学习率,修剪比率和模型容量)。作为我们发现的实际应用,我们证明了这样的子网络可以帮助减少对抗性训练的总时间,这是一种改善鲁棒性的标准方法,在CIFAR-10上最多可提高49%\%,以实现最先进的鲁棒性。
Recent research has proposed the lottery ticket hypothesis, suggesting that for a deep neural network, there exist trainable sub-networks performing equally or better than the original model with commensurate training steps. While this discovery is insightful, finding proper sub-networks requires iterative training and pruning. The high cost incurred limits the applications of the lottery ticket hypothesis. We show there exists a subset of the aforementioned sub-networks that converge significantly faster during the training process and thus can mitigate the cost issue. We conduct extensive experiments to show such sub-networks consistently exist across various model structures for a restrictive setting of hyperparameters ($e.g.$, carefully selected learning rate, pruning ratio, and model capacity). As a practical application of our findings, we demonstrate that such sub-networks can help in cutting down the total time of adversarial training, a standard approach to improve robustness, by up to 49\% on CIFAR-10 to achieve the state-of-the-art robustness.