论文标题
受限统计学习的经验双重性差距
The empirical duality gap of constrained statistical learning
论文作者
论文摘要
本文涉及对受限统计学习问题的研究,其不受约束的版本实际上是所有现代信息处理的核心。但是,对约束的核算对于纳入先验知识并在解决方案上施加了所需的结构和统计特性至关重要。尽管如此,解决受限的统计问题仍然具有挑战性并确保稀缺,使它们使用正规化的配方进行解决。尽管实用有效,但是选择正则化参数以满足要求,如果可能的话,由于参数和约束之间缺乏直接的关系,因此具有挑战性。在这项工作中,我们建议通过利用有限维度参数化,样本平均值和二元性理论来直接解决克服其无限维度,未知分布和约束的约束统计问题。除了解决问题外,这些工具还允许我们限制经验二元性差距,即我们近似可牵引的解决方案与原始统计问题的实际解决方案之间的差异。我们证明了在公平学习应用中这种受约束的公式的有效性和实用性。
This paper is concerned with the study of constrained statistical learning problems, the unconstrained version of which are at the core of virtually all of modern information processing. Accounting for constraints, however, is paramount to incorporate prior knowledge and impose desired structural and statistical properties on the solutions. Still, solving constrained statistical problems remains challenging and guarantees scarce, leaving them to be tackled using regularized formulations. Though practical and effective, selecting regularization parameters so as to satisfy requirements is challenging, if at all possible, due to the lack of a straightforward relation between parameters and constraints. In this work, we propose to directly tackle the constrained statistical problem overcoming its infinite dimensionality, unknown distributions, and constraints by leveraging finite dimensional parameterizations, sample averages, and duality theory. Aside from making the problem tractable, these tools allow us to bound the empirical duality gap, i.e., the difference between our approximate tractable solutions and the actual solutions of the original statistical problem. We demonstrate the effectiveness and usefulness of this constrained formulation in a fair learning application.