论文标题
并非所有功能都是平等的:发现保存预测隐私的基本功能
Not All Features Are Equal: Discovering Essential Features for Preserving Prediction Privacy
论文作者
论文摘要
从云中接收机器学习服务时,提供商不需要接收所有功能;实际上,目标预测任务只需要一个子集。辨别此子集是这项工作的关键问题。我们将此问题提出为基于梯度的扰动最大化方法,该方法在提供商使用的预测模型的功能方面发现了输入特征空间中的该子集。在识别子集后,我们的框架斗篷使用实用程序的恒定值抑制其余功能,这些恒定值通过基于单独的梯度优化过程发现。我们表明,斗篷不一定需要服务提供商的正常服务的协作,并且可以在我们只能对服务提供商模型的黑箱访问的情况下应用。从理论上讲,我们保证披风的优化减少了数据和发送的筛选表示之间的相互信息(MI)的上限。实验结果表明,斗篷将输入和筛分表示之间的相互信息降低了85.01%,而效用只有可忽略不计(1.42%)。此外,我们表明斗篷大大降低了对手学习和推断出无能力的功能的能力。
When receiving machine learning services from the cloud, the provider does not need to receive all features; in fact, only a subset of the features are necessary for the target prediction task. Discerning this subset is the key problem of this work. We formulate this problem as a gradient-based perturbation maximization method that discovers this subset in the input feature space with respect to the functionality of the prediction model used by the provider. After identifying the subset, our framework, Cloak, suppresses the rest of the features using utility-preserving constant values that are discovered through a separate gradient-based optimization process. We show that Cloak does not necessarily require collaboration from the service provider beyond its normal service, and can be applied in scenarios where we only have black-box access to the service provider's model. We theoretically guarantee that Cloak's optimizations reduce the upper bound of the Mutual Information (MI) between the data and the sifted representations that are sent out. Experimental results show that Cloak reduces the mutual information between the input and the sifted representations by 85.01% with only a negligible reduction in utility (1.42%). In addition, we show that Cloak greatly diminishes adversaries' ability to learn and infer non-conducive features.