论文标题
TED谈话等级中的偏见检测和缓解
Detection and Mitigation of Bias in Ted Talk Ratings
论文作者
论文摘要
公正的数据收集对于保证人工智能模型的公平性至关重要。隐性偏见,一种行为条件形式,使我们将预定的特征归因于某些组成员并告知数据收集过程。本文量化了TedTalks的观众评分的隐性偏见,这是一个评估社会和专业表现的多样化的社交平台,以表明跨敏感属性的各种偏见的相关性。尽管这些视频的观众评分应纯粹反映了说话者的能力和技巧,但我们对评分的分析表明,在种族和性别方面存在压倒性和主要的隐性偏见。在我们的论文中,我们提出了检测和减轻偏见的策略,这些偏见对于消除AI中的不公平至关重要。
Unbiased data collection is essential to guaranteeing fairness in artificial intelligence models. Implicit bias, a form of behavioral conditioning that leads us to attribute predetermined characteristics to members of certain groups and informs the data collection process. This paper quantifies implicit bias in viewer ratings of TEDTalks, a diverse social platform assessing social and professional performance, in order to present the correlations of different kinds of bias across sensitive attributes. Although the viewer ratings of these videos should purely reflect the speaker's competence and skill, our analysis of the ratings demonstrates the presence of overwhelming and predominant implicit bias with respect to race and gender. In our paper, we present strategies to detect and mitigate bias that are critical to removing unfairness in AI.