论文标题
倾斜的经验风险最小化
Tilted Empirical Risk Minimization
论文作者
论文摘要
经验风险最小化(ERM)通常设计为在平均损失上表现良好,这可能导致估计值对异常值敏感,概括或不公平地对待亚组。尽管许多方法旨在单独解决这些问题,但在这项工作中,我们通过统一的框架探索它们 - 倾斜的经验风险最小化(术语)。特别是,我们表明,可以使用称为倾斜的超参数通过直接扩展到ERM的直接扩展来灵活调整单个损失的影响。我们提供了对由此产生的框架的几种解释:我们表明术语可以分别增加或减少异常值的影响,以实现公平或稳健性;具有减少差异的属性,可以使概括受益;并且可以看作是对超品牌方法的平滑近似。我们开发了用于解决项的批处理和随机的一阶优化方法,并证明该问题可以相对于共同替代方案有效解决。最后,我们证明可以将术语用于多种应用,例如在子组之间执行公平性,减轻离群值的效果以及处理类失衡。术语不仅与针对这些个人问题量身定制的现有解决方案竞争,而且还可以启用全新的应用程序,例如同时解决异常值和促进公平。
Empirical risk minimization (ERM) is typically designed to perform well on the average loss, which can result in estimators that are sensitive to outliers, generalize poorly, or treat subgroups unfairly. While many methods aim to address these problems individually, in this work, we explore them through a unified framework -- tilted empirical risk minimization (TERM). In particular, we show that it is possible to flexibly tune the impact of individual losses through a straightforward extension to ERM using a hyperparameter called the tilt. We provide several interpretations of the resulting framework: We show that TERM can increase or decrease the influence of outliers, respectively, to enable fairness or robustness; has variance-reduction properties that can benefit generalization; and can be viewed as a smooth approximation to a superquantile method. We develop batch and stochastic first-order optimization methods for solving TERM, and show that the problem can be efficiently solved relative to common alternatives. Finally, we demonstrate that TERM can be used for a multitude of applications, such as enforcing fairness between subgroups, mitigating the effect of outliers, and handling class imbalance. TERM is not only competitive with existing solutions tailored to these individual problems, but can also enable entirely new applications, such as simultaneously addressing outliers and promoting fairness.