切片调谐器：用于准确且公平的机器学习模型的选择性数据采集框架

论文标题

切片调谐器：用于准确且公平的机器学习模型的选择性数据采集框架

Slice Tuner: A Selective Data Acquisition Framework for Accurate and Fair Machine Learning Models

论文作者

Tae, Ki Hyun, Whang, Steven Euijong

论文摘要

随着机器学习在软件2.0时代的民主化，严重的瓶颈正在获取足够的数据以确保准确且公平的模型。包括众包在内的最新技术提供了收集此类数据的经济高效方法。但是，尽可能多地获取数据并不一定是优化准确性和公平性的有效策略。例如，如果在线应用商店有足够的培训数据来用于某些片段的数据（例如美国客户），但不适合其他数据，获得更多的美国客户数据只会偏向模型培训。取而代之的是，我们认为需要选择性获取数据并提出切片调谐器，该切片调谐器每片获取可能不同量的数据，以便优化所有切片的模型准确性和公平性。此问题与对现有数据（在主动学习或弱监督下）标记标签不同，因为目标是获得适量的新数据。 Slice调谐器的核心保持了切片的学习曲线，这些曲线估计了具有更多数据的模型精度，并使用凸优化来找到最佳的数据采集策略。估计学习曲线的主要挑战是，如果数据不足，它们可能不准确，并且在切片之间可能存在依赖性，其中获取一个切片的数据会影响他人的学习曲线。我们通过迭代，有效地更新学习曲线来解决这些问题，随着获得更多数据的获取。我们使用众包进行数据采集来评估实际数据集上的切片调谐器，并表明切片调谐器在模型的准确性和公平性方面显着优于基准，即使无法可靠地估计学习曲线。

As machine learning becomes democratized in the era of Software 2.0, a serious bottleneck is acquiring enough data to ensure accurate and fair models. Recent techniques including crowdsourcing provide cost-effective ways to gather such data. However, simply acquiring data as much as possible is not necessarily an effective strategy for optimizing accuracy and fairness. For example, if an online app store has enough training data for certain slices of data (say American customers), but not for others, obtaining more American customer data will only bias the model training. Instead, we contend that one needs to selectively acquire data and propose Slice Tuner, which acquires possibly-different amounts of data per slice such that the model accuracy and fairness on all slices are optimized. This problem is different than labeling existing data (as in active learning or weak supervision) because the goal is obtaining the right amounts of new data. At its core, Slice Tuner maintains learning curves of slices that estimate the model accuracies given more data and uses convex optimization to find the best data acquisition strategy. The key challenges of estimating learning curves are that they may be inaccurate if there is not enough data, and there may be dependencies among slices where acquiring data for one slice influences the learning curves of others. We solve these issues by iteratively and efficiently updating the learning curves as more data is acquired. We evaluate Slice Tuner on real datasets using crowdsourcing for data acquisition and show that Slice Tuner significantly outperforms baselines in terms of model accuracy and fairness, even when the learning curves cannot be reliably estimated.

下载PDF全文

下载文献需遵守相关版权规定

论文标题