如何通过Stylegan提高面部识别？

论文标题

如何通过Stylegan提高面部识别？

How to Boost Face Recognition with StyleGAN?

论文作者

Sevastopolsky, Artem, Malkov, Yury, Durasov, Nikita, Verdoliva, Luisa, Nießner, Matthias

论文摘要

最先进的面部识别系统需要大量标记的培训数据。鉴于面部识别应用程序中隐私的优先级，数据仅限于名人网络爬网，这些爬网的身份数量有限。另一方面，行业中的自我监督革命激发了对相关技术对面部识别的适应的研究。最受欢迎的实用技巧之一是通过从生成模型中获取的样本在保留身份的同时增强数据集。我们表明，一种基于微调PSP编码器的简单方法使我们能够改善最先进的面部识别，并且与培训合成面部身份相比，我们的表现更好。我们还收集具有可控的种族宪法的大规模的未标记数据集-AfricanFaceset-5M（500万张不同人的图像）和Asianfaceset-3M（300万个不同人的图像） - 我们表明，对每个人进行预估计，可以改善对各自的种族的认识（以及其他人），同时将所有无标记的数据列表结合在一起。我们的自我监督策略是最有用的，具有有限的标记培训数据，这对于更量身定制的面部识别任务以及面对隐私问题可能是有益的。评估基于标准的RFW数据集和新的大型RB-Webface基准。代码和数据可在https://github.com/seva100/stylegan-for-facerec上公开获得。

State-of-the-art face recognition systems require vast amounts of labeled training data. Given the priority of privacy in face recognition applications, the data is limited to celebrity web crawls, which have issues such as limited numbers of identities. On the other hand, self-supervised revolution in the industry motivates research on the adaptation of related techniques to facial recognition. One of the most popular practical tricks is to augment the dataset by the samples drawn from generative models while preserving the identity. We show that a simple approach based on fine-tuning pSp encoder for StyleGAN allows us to improve upon the state-of-the-art facial recognition and performs better compared to training on synthetic face identities. We also collect large-scale unlabeled datasets with controllable ethnic constitution -- AfricanFaceSet-5M (5 million images of different people) and AsianFaceSet-3M (3 million images of different people) -- and we show that pretraining on each of them improves recognition of the respective ethnicities (as well as others), while combining all unlabeled datasets results in the biggest performance increase. Our self-supervised strategy is the most useful with limited amounts of labeled training data, which can be beneficial for more tailored face recognition tasks and when facing privacy concerns. Evaluation is based on a standard RFW dataset and a new large-scale RB-WebFace benchmark. The code and data are made publicly available at https://github.com/seva100/stylegan-for-facerec.

下载PDF全文

下载文献需遵守相关版权规定

论文标题