WebFace260M: A Benchmark Unveiling the Power of Million-Scale Deep Face Recognition

Zhu, Zheng; Huang, Guan; Deng, Jiankang; Ye, Yun; Huang, Junjie; Chen, Xinze; Zhu, Jiagang; Yang, Tian; Lu, Jiwen; Du, Dalong; Zhou, Jie

Computer Science > Computer Vision and Pattern Recognition

arXiv:2103.04098 (cs)

[Submitted on 6 Mar 2021]

Title:WebFace260M: A Benchmark Unveiling the Power of Million-Scale Deep Face Recognition

Authors:Zheng Zhu, Guan Huang, Jiankang Deng, Yun Ye, Junjie Huang, Xinze Chen, Jiagang Zhu, Tian Yang, Jiwen Lu, Dalong Du, Jie Zhou

View PDF

Abstract:In this paper, we contribute a new million-scale face benchmark containing noisy 4M identities/260M faces (WebFace260M) and cleaned 2M identities/42M faces (WebFace42M) training data, as well as an elaborately designed time-constrained evaluation protocol. Firstly, we collect 4M name list and download 260M faces from the Internet. Then, a Cleaning Automatically utilizing Self-Training (CAST) pipeline is devised to purify the tremendous WebFace260M, which is efficient and scalable. To the best of our knowledge, the cleaned WebFace42M is the largest public face recognition training set and we expect to close the data gap between academia and industry. Referring to practical scenarios, Face Recognition Under Inference Time conStraint (FRUITS) protocol and a test set are constructed to comprehensively evaluate face matchers.
Equipped with this benchmark, we delve into million-scale face recognition problems. A distributed framework is developed to train face recognition models efficiently without tampering with the performance. Empowered by WebFace42M, we reduce relative 40% failure rate on the challenging IJB-C set, and ranks the 3rd among 430 entries on NIST-FRVT. Even 10% data (WebFace4M) shows superior performance compared with public training set. Furthermore, comprehensive baselines are established on our rich-attribute test set under FRUITS-100ms/500ms/1000ms protocol, including MobileNet, EfficientNet, AttentionNet, ResNet, SENet, ResNeXt and RegNet families. Benchmark website is this https URL.

Comments:	Accepted by CVPR2021. Benchmark website is this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2103.04098 [cs.CV]
	(or arXiv:2103.04098v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2103.04098

Submission history

From: Zheng Zhu [view email]
[v1] Sat, 6 Mar 2021 11:12:43 UTC (3,632 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:WebFace260M: A Benchmark Unveiling the Power of Million-Scale Deep Face Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:WebFace260M: A Benchmark Unveiling the Power of Million-Scale Deep Face Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators