MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition
Yandong Guo,Lei Zhang,Yuxiao Hu,Xiaodong He,Jianfeng Gao +4 more
- 08 Oct 2016
- pp 87-102
TL;DR: In this article, the authors proposed a benchmark task to recognize one million celebrities from their face images, by using all the possibly collected face images of this individual on the web as training data.
read more
Abstract: In this paper, we design a benchmark task and provide the associated datasets for recognizing face images and link them to corresponding entity keys in a knowledge base. More specifically, we propose a benchmark task to recognize one million celebrities from their face images, by using all the possibly collected face images of this individual on the web as training data. The rich information provided by the knowledge base helps to conduct disambiguation and improve the recognition accuracy, and contributes to various real-world applications, such as image captioning and news video analysis. Associated with this task, we design and provide concrete measurement set, evaluation protocol, as well as training data. We also present in details our experiment setup and report promising baseline results. Our benchmark task could lead to one of the largest classification problems in computer vision. To the best of our knowledge, our training dataset, which contains 10M images in version 1, is the largest publicly available one in the world.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Posted Content
Circle Loss: A Unified Perspective of Pair Similarity Optimization
TL;DR: The Circle loss is demonstrated, which has a unified formula for two elemental deep feature learning paradigms, learning with class-level labels and pair-wise labels, and the superiority of the Circle loss on a variety ofDeep feature learning tasks.
Range Loss for Deep Face Recognition with Long-Tailed Training Data
Xiao Zhang,Zhiyuan Fang,Yandong Wen,Zhifeng Li,Yu Qiao +4 more
- 01 Oct 2017
TL;DR: Zhang et al. as discussed by the authors investigated how long-tailed data impact the training of face CNNs and developed a novel loss function, called range loss, to effectively utilize the tailed data in training process.
Domain Generalization: A Survey
TL;DR: Domain generalization (DG) aims to achieve OOD generalization by using only source data for model learning as mentioned in this paper , which is a capability natural to humans yet challenging for machines to reproduce.
Learning Deep Representations with Probabilistic Knowledge Transfer
Nikolaos Passalis,Anastasios Tefas +1 more
- 08 Sep 2018
TL;DR: In this paper, a probabilistic knowledge transfer method that works by matching the probability distribution of the data in the feature space instead of their actual representation is proposed. But this method cannot be used efficiently for other representation learning tasks.
Universal Approximation Capability of Broad Learning System and Its Structural Variations
TL;DR: A mathematical proof of the universal approximation property of BLS is provided and the framework of several BLS variants with their mathematical modeling is given, which include cascade, recurrent, and broad–deep combination structures.
536
References
•Proceedings Article
ImageNet Classification with Deep Convolutional Neural Networks
Alex Krizhevsky,Ilya Sutskever,Geoffrey E. Hinton +2 more
- 03 Dec 2012
TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.
ImageNet Large Scale Visual Recognition Challenge
Olga Russakovsky,Jia Deng,Hao Su,Jonathan Krause,Sanjeev Satheesh,Sean Ma,Zhiheng Huang,Andrej Karpathy,Aditya Khosla,Michael S. Bernstein,Alexander C. Berg,Li Fei-Fei +11 more
TL;DR: The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) as mentioned in this paper is a benchmark in object category classification and detection on hundreds of object categories and millions of images, which has been run annually from 2010 to present, attracting participation from more than fifty institutions.
FaceNet: A Unified Embedding for Face Recognition and Clustering
TL;DR: FaceNet as discussed by the authors uses a deep convolutional network trained to directly optimize the embedding itself, rather than an intermediate bottleneck layer as in previous deep learning approaches, and achieves state-of-the-art face recognition performance using only 128 bytes per face.
14.2K
•Proceedings Article
On Spectral Clustering: Analysis and an algorithm
Andrew Y. Ng,Michael I. Jordan,Yair Weiss +2 more
- 03 Jan 2001
TL;DR: A simple spectral clustering algorithm that can be implemented using a few lines of Matlab is presented, and tools from matrix perturbation theory are used to analyze the algorithm, and give conditions under which it can be expected to do well.
FaceNet: A unified embedding for face recognition and clustering
Florian Schroff,Dmitry Kalenichenko,James Philbin +2 more
- 07 Jun 2015
TL;DR: A system that directly learns a mapping from face images to a compact Euclidean space where distances directly correspond to a measure offace similarity, and achieves state-of-the-art face recognition performance using only 128-bytes perface.
Related Papers (5)
Kaiming He,Xiangyu Zhang,Shaoqing Ren,Jian Sun +3 more
- 27 Jun 2016
Omkar M. Parkhi,Andrea Vedaldi,Andrew Zisserman +2 more
- 01 Jan 2015