MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition

Open AccessPosted Content

MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition

- 27 Jul 2016

- arXiv: Computer Vision and Pattern Recog...

1.3K

TL;DR: A benchmark task to recognize one million celebrities from their face images, by using all the possibly collected face images of this individual on the web as training data, which could lead to one of the largest classification problems in computer vision.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Figures

Fig. 1. An example of our face recognition task. Our task is to recognize the face in the image and then link this face with the corresponding entity key in the knowledge base. By recognizing the left image to be “Anne Hathaway” and linking to the entity key, we know she is an American actress born in 1982, who has played Mia Thermopolis in The Princess Diaries, not the other Anne Hathaway who was the wife of William Shakespeare. Input image is from the web. 2

Fig. 2. Distribution of the properties of the celebrities in our one-million list in different aspects. The large scale of our dataset naturally introduces great diversity. As shown in (a) and (b), we include persons with more than 2000 different professions, and come from more than 200 distinct countries/regions. The figure (c) demonstrates that we don’t include celebrities who were born before 1846 (long time before the first rollfilm specialized camera “Kodak” was invented [19]) and covers celebrities of a large variance of age. In (d), we notice that we have more females than males in our onemillion celebrity list. This might be correlated with the profession distribution in our list.

Fig. 4. Examples (subset) of the training images for the celebrity with entity key m.06y3r (Steve Jobs). The image marked with a green rectangle is claimed to be Steve Jobs when he was in high school. The image marked with a red rectangle is considered as a noise sample in our dataset, since it is synthesized by combining one image of Steve Jobs and one image of Ashton Kutcher, who is the actor in the movie “Jobs”.

Fig. 3. Labeling GUI for “Chuck Palhniuk”. (partial view) As shown in the figure, in the upper right corner, a representative image and a short description is provided. For a given image candidate, judge can label as “not for this celebrity” (red), “yes for this celebrity” (green), or “broken image” (dark gray).

Citations

•Posted Content

Clustering based Contrastive Learning for Improving Face Representations

Vivek Sharma, +3 more

- 05 Apr 2020

- arXiv: Computer Vision and Pattern Recog...

TL;DR: This work presents Clustering-based Contrastive Learning (CCL), a new clustering- based representation learning approach that uses labels obtained from clustering along with video constraints to learn discriminative face features.

...read moreread less

38

•Proceedings Article•10.1109/ICCVW54120.2021.00458

Rethinking Common Assumptions to Mitigate Racial Bias in Face Recognition Datasets

Matthew Gwilliam, +3 more

- 01 Oct 2021

TL;DR: The authors showed that training on only African faces induced less bias than training on a balanced distribution of faces and distributions skewed to include more African faces produced more equitable models, and that adding more images of existing identities to a dataset in place of adding new identities can lead to accuracy boosts across racial categories.

...read moreread less

38

Proceedings Article•10.1109/ICIP40778.2020.9190643

Learning Discriminative Representation For Facial Expression Recognition From Uncertainties

Xingyu Fan, +4 more

- 01 Oct 2020

TL;DR: Novel Rayleigh and weighted-softmax loss from two aspects are introduced to extract discriminative representation and a weight is introduced to measure the uncertainty of a given sample, by considering its distance to class center.

...read moreread less

38

Journal Article•10.48550/arXiv.2211.06627

MARLIN: Masked Autoencoder for facial video Representation LearnINg

Zhi Cai, +7 more

- 12 Nov 2022

- arXiv.org

TL;DR: In this paper , a self-supervised approach is proposed to learn universal facial representations from videos, that can transfer across a variety of facial analysis tasks such as Facial Attribute Recognition (FAR), Facial Expression recognition (FER), DeepFake Detection (DFD), and Lip Synchronization (LS).

...read moreread less

37

•Journal Article•10.1049/BME2.12046

Facial masks and soft‐biometrics: Leveraging face recognition CNNs for age and gender prediction on mobile ocular images

Fernando Alonso-Fernandez, +4 more

- 01 Sep 2021

- IET Biometrics

TL;DR: A comprehensive study of the effects of different pre-training over the employed architectures is carried out, showing that, in most cases, a better accuracy is obtained after the networks have been fine-tuned for face recognition.

...read moreread less

37

...

Expand

References

•Journal Article•10.1145/3065386

ImageNet classification with deep convolutional neural networks

Alex Krizhevsky, +2 more

- 24 May 2017

- Communications of The ACM

TL;DR: A large, deep convolutional neural network was trained to classify the 1.2 million high-resolution images in the ImageNet LSVRC-2010 contest into the 1000 different classes and employed a recently developed regularization method called "dropout" that proved to be very effective.

...read moreread less

98.2K

•Journal Article•10.1007/S11263-015-0816-Y

ImageNet Large Scale Visual Recognition Challenge

Olga Russakovsky, +11 more

- 01 Dec 2015

- International Journal of Computer Vision

TL;DR: The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) as mentioned in this paper is a benchmark in object category classification and detection on hundreds of object categories and millions of images, which has been run annually from 2010 to present, attracting participation from more than fifty institutions.

...read moreread less

41.6K

•Journal Article

ImageNet Large Scale Visual Recognition Challenge

Olga Russakovsky, +11 more

- 01 Apr 2015

- Springer US

TL;DR: The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) has been running annually for five years (since 2010) and has become the standard benchmark for large-scale object recognition.

...read moreread less

23.9K

•Proceedings Article•10.1109/CVPR.2015.7298682

FaceNet: A Unified Embedding for Face Recognition and Clustering

Florian Schroff, +2 more

- 12 Mar 2015

- arXiv: Computer Vision and Pattern Recog...

TL;DR: FaceNet as discussed by the authors uses a deep convolutional network trained to directly optimize the embedding itself, rather than an intermediate bottleneck layer as in previous deep learning approaches, and achieves state-of-the-art face recognition performance using only 128 bytes per face.

...read moreread less

14.2K

•Proceedings Article

On Spectral Clustering: Analysis and an algorithm

Andrew Y. Ng, +2 more

- 03 Jan 2001

TL;DR: A simple spectral clustering algorithm that can be implemented using a few lines of Matlab is presented, and tools from matrix perturbation theory are used to analyze the algorithm, and give conditions under which it can be expected to do well.

...read moreread less

10.4K

...

Expand

MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition

Chat with Paper

AI Agents for this Paper

Figures

Citations

Clustering based Contrastive Learning for Improving Face Representations

Rethinking Common Assumptions to Mitigate Racial Bias in Face Recognition Datasets

Learning Discriminative Representation For Facial Expression Recognition From Uncertainties

MARLIN: Masked Autoencoder for facial video Representation LearnINg

Facial masks and soft‐biometrics: Leveraging face recognition CNNs for age and gender prediction on mobile ocular images

References

ImageNet classification with deep convolutional neural networks

ImageNet Large Scale Visual Recognition Challenge

ImageNet Large Scale Visual Recognition Challenge

FaceNet: A Unified Embedding for Face Recognition and Clustering

On Spectral Clustering: Analysis and an algorithm

Related Papers (5)

Labeled Faces in the Wild: A Database forStudying Face Recognition in Unconstrained Environments

Deep Residual Learning for Image Recognition

Deep face recognition

FaceNet: A unified embedding for face recognition and clustering

DeepFace: Closing the Gap to Human-Level Performance in Face Verification