煤矿中的金丝雀：更好的会员推断与结合的对抗性疑问

论文标题

煤矿中的金丝雀：更好的会员推断与结合的对抗性疑问

Canary in a Coalmine: Better Membership Inference with Ensembled Adversarial Queries

论文作者

Wen, Yuxin, Bansal, Arpit, Kazemi, Hamid, Borgnia, Eitan, Goldblum, Micah, Geiping, Jonas, Goldstein, Tom

论文摘要

随着机器学习模型越来越多地自动化工业应用程序，执行个人数据所有权和知识产权要求将培训数据追溯到其应有的所有者。会员推断算法通过使用统计技术来辨别模型训练集是否包括目标样本来解决此问题。但是，现有方法仅利用未更改的目标样本或目标的简单增强来计算统计。对模型行为的稀疏抽样几乎没有信息，从而导致推理能力差。在这项工作中，我们使用对抗性工具直接优化歧视性和多样性的查询。与现有方法相比，我们的改进获得了更准确的会员推论，尤其是在离线场景和低阳性制度中，这在法律环境中至关重要。代码可在https://github.com/yuxinwenrick/canary-in--a-coalmine上找到。

As industrial applications are increasingly automated by machine learning models, enforcing personal data ownership and intellectual property rights requires tracing training data back to their rightful owners. Membership inference algorithms approach this problem by using statistical techniques to discern whether a target sample was included in a model's training set. However, existing methods only utilize the unaltered target sample or simple augmentations of the target to compute statistics. Such a sparse sampling of the model's behavior carries little information, leading to poor inference capabilities. In this work, we use adversarial tools to directly optimize for queries that are discriminative and diverse. Our improvements achieve significantly more accurate membership inference than existing methods, especially in offline scenarios and in the low false-positive regime which is critical in legal settings. Code is available at https://github.com/YuxinWenRick/canary-in-a-coalmine.

下载PDF全文

下载文献需遵守相关版权规定

论文标题