对树团的有效对抗性攻击

论文标题

对树团的有效对抗性攻击

An Efficient Adversarial Attack for Tree Ensembles

论文作者

Zhang, Chong, Zhang, Huan, Hsieh, Cho-Jui

论文摘要

我们研究了对基于树的合奏的有效对抗性攻击的问题，例如梯度增强决策树（GBDTS）和随机森林（RFS）。由于这些模型是非连续的步骤函数，并且不存在梯度，因此大多数现有的有效对抗攻击不适用。尽管可以采用基于决策的黑盒攻击，但它们不能利用树木的特殊结构。在我们的工作中，我们将攻击问题转换为专门为树同团而设计的离散搜索问题，其目标是找到有效的“叶子元组”，导致错误分类，而与原始输入的距离最短。有了这种表述，我们表明，可以通过将叶子元组移至锤子距离内的叶子到附近的邻居1。在锤子距离1中，可以将一种简单而有效的贪婪算法应用于迭代地优化对抗性示例1。在几种大的GBDT和RF模型上进行实验结果，具有多达数百棵树的速度比以前的混合程序更较高（同时提供的较小的混合程序），这表明我们的方法更及时（既既富comperter）。示例比对一般$ \ ell_p $（$ p = 1，2，\ infty $）的常规式攻击的示例。我们的代码可在https://github.com/chong-z/tree-ensemble-attack上找到。

We study the problem of efficient adversarial attacks on tree based ensembles such as gradient boosting decision trees (GBDTs) and random forests (RFs). Since these models are non-continuous step functions and gradient does not exist, most existing efficient adversarial attacks are not applicable. Although decision-based black-box attacks can be applied, they cannot utilize the special structure of trees. In our work, we transform the attack problem into a discrete search problem specially designed for tree ensembles, where the goal is to find a valid "leaf tuple" that leads to mis-classification while having the shortest distance to the original input. With this formulation, we show that a simple yet effective greedy algorithm can be applied to iteratively optimize the adversarial example by moving the leaf tuple to its neighborhood within hamming distance 1. Experimental results on several large GBDT and RF models with up to hundreds of trees demonstrate that our method can be thousands of times faster than the previous mixed-integer linear programming (MILP) based approach, while also providing smaller (better) adversarial examples than decision-based black-box attacks on general $\ell_p$ ($p=1, 2, \infty$) norm perturbations. Our code is available at https://github.com/chong-z/tree-ensemble-attack.

下载PDF全文

下载文献需遵守相关版权规定

论文标题