论文标题

ARES:面向系统的战争游戏框架

Ares: A System-Oriented Wargame Framework for Adversarial ML

论文作者

Ahmed, Farhan, Vaishnavi, Pratik, Eykholt, Kevin, Rahmati, Amir

论文摘要

自从近十年前发现针对机器学习模型的对抗性攻击以来,对对抗性机器学习的研究已迅速发展成为防守者之间的永恒战争,后卫试图提高ML模型对对抗性攻击的鲁棒性,而对对抗性攻击的稳健性,他们试图发展能够削弱或击败这些防御力的更好的攻击。但是,这个领域几乎没有从ML从业人员获得的买入,他们既不关心影响现实世界中其系统的这些攻击,也不愿意以追求对这些攻击的鲁棒性来权衡其模型的准确性。 在本文中,我们激励ARES的设计和实施,ARES是对对抗性ML的评估框架,使研究人员能够在现实的战争游戏中探索攻击和防御。 Ares在具有相反目标的强化学习环境中,攻击者和后卫之间的冲突是两个代理。这允许引入系统级评估指标,例如时间失败和评估复杂策略,例如移动目标防御。我们提供了初步探索的结果,涉及对对抗训练的防守者的白盒攻击者。

Since the discovery of adversarial attacks against machine learning models nearly a decade ago, research on adversarial machine learning has rapidly evolved into an eternal war between defenders, who seek to increase the robustness of ML models against adversarial attacks, and adversaries, who seek to develop better attacks capable of weakening or defeating these defenses. This domain, however, has found little buy-in from ML practitioners, who are neither overtly concerned about these attacks affecting their systems in the real world nor are willing to trade off the accuracy of their models in pursuit of robustness against these attacks. In this paper, we motivate the design and implementation of Ares, an evaluation framework for adversarial ML that allows researchers to explore attacks and defenses in a realistic wargame-like environment. Ares frames the conflict between the attacker and defender as two agents in a reinforcement learning environment with opposing objectives. This allows the introduction of system-level evaluation metrics such as time to failure and evaluation of complex strategies such as moving target defenses. We provide the results of our initial exploration involving a white-box attacker against an adversarially trained defender.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源