论文标题
用脱钩的正规器利用硬混合样品
Harnessing Hard Mixed Samples with Decoupled Regularizer
论文作者
论文摘要
混音是一种有效的数据增强方法,它通过使用混合数据来平滑决策边界来改善神经网络的概括。最近,动态混合方法通过最大化混合样品中与目标相关的明显区域有效改善了先前的静态策略(例如,线性插值),但过多的额外时间成本是不可接受的。这些其他计算开销主要来自根据混合标签优化混合样品。但是,我们发现额外的优化步骤可能是多余的,因为标签不匹配的混合样品是深层模型的信息性混合样品,以定位判别特征。因此,在本文中,我们不是试图提出一个更复杂的动态混合策略,而是使用名为“脱钩混合”(DM)的脱钩正规器(DM)的有效混合目标函数。主要效果是DM可以自适应地利用这些硬混合样品来开采判别特征,而不会失去混合的原始光滑度。结果,DM使静态混合方法能够实现可比较甚至超过动态方法的性能,而无需任何额外的计算。这也导致了一个有趣的客观设计问题,用于混合培训,我们需要专注于平滑决策边界并确定歧视性特征。关于七个数据集的监督和半监督学习基准测试的广泛实验验证了DM作为插件模块的有效性。源代码和型号可从https://github.com/westlake-ai/openmixup获得
Mixup is an efficient data augmentation approach that improves the generalization of neural networks by smoothing the decision boundary with mixed data. Recently, dynamic mixup methods have improved previous static policies effectively (e.g., linear interpolation) by maximizing target-related salient regions in mixed samples, but excessive additional time costs are not acceptable. These additional computational overheads mainly come from optimizing the mixed samples according to the mixed labels. However, we found that the extra optimizing step may be redundant because label-mismatched mixed samples are informative hard mixed samples for deep models to localize discriminative features. In this paper, we thus are not trying to propose a more complicated dynamic mixup policy but rather an efficient mixup objective function with a decoupled regularizer named Decoupled Mixup (DM). The primary effect is that DM can adaptively utilize those hard mixed samples to mine discriminative features without losing the original smoothness of mixup. As a result, DM enables static mixup methods to achieve comparable or even exceed the performance of dynamic methods without any extra computation. This also leads to an interesting objective design problem for mixup training that we need to focus on both smoothing the decision boundaries and identifying discriminative features. Extensive experiments on supervised and semi-supervised learning benchmarks across seven datasets validate the effectiveness of DM as a plug-and-play module. Source code and models are available at https://github.com/Westlake-AI/openmixup