论文标题
评估人群中虚假不匹配率差异的统计方法
Statistical Methods for Assessing Differences in False Non-Match Rates Across Demographic Groups
论文作者
论文摘要
从网络安全到边境安全的各种应用中都使用了生物识别。最近的研究集中在确保生物识别性能(假否定性和假阳性)之间是在人群中公平的。尽管指标的发展取得了重大进展,对跨组的绩效的评估以及缓解任何问题的评估,但几乎没有纳入统计差异的工作。这很重要,因为当不存在差异时,可以偶然发现群体之间的差异。在统计中,这称为I类错误。组之间的差异可能是由于抽样变化引起的,或者可能是由于系统性能的实际差异。区分这两个错误来源对于关于公平和公平的良好决策至关重要。本文介绍了两种新型统计方法,用于评估人群群体的公平性。第一种方法是一种基于自举的假设检验,而第二种方法是更简单的测试方法,以非统计受众的态度。对于后者,我们介绍了一项关于误差余量与受试者数量,尝试次数,尝试之间的相关性,基本的虚假匹配率(FNMR)和组数量之间的关系之间关系的模拟研究结果。
Biometric recognition is used across a variety of applications from cyber security to border security. Recent research has focused on ensuring biometric performance (false negatives and false positives) is fair across demographic groups. While there has been significant progress on the development of metrics, the evaluation of the performance across groups, and the mitigation of any problems, there has been little work incorporating statistical variation. This is important because differences among groups can be found by chance when no difference is present. In statistics this is called a Type I error. Differences among groups may be due to sampling variation or they may be due to actual difference in system performance. Discriminating between these two sources of error is essential for good decision making about fairness and equity. This paper presents two novel statistical approaches for assessing fairness across demographic groups. The first methodology is a bootstrapped-based hypothesis test, while the second is simpler test methodology focused upon non-statistical audience. For the latter we present the results of a simulation study about the relationship between the margin of error and factors such as number of subjects, number of attempts, correlation between attempts, underlying false non-match rates(FNMR's), and number of groups.