Open Access
Towards a General Model of Knowledge for Facial Analysis by Multi-Source Transfer Learning
Valentin Vielzeuf,Alexis Lechervy,Stéphane Pateux,Frédéric Jurie +3 more
- 01 Mar 2020
pp 241-251
9
TL;DR: In this article, a multi-source transfer learning approach is proposed to obtain general models of knowledge for facial analysis, which consists in two successive training steps: the first one consists in applying a combination operator to define a common embedding for the multiple sources materialized by different existing trained models.
read more
Abstract: This paper proposes a step toward obtaining general models of knowledge for facial analysis, by addressing the question of multi-source transfer learning. More precisely, the proposed approach consists in two successive training steps: the first one consists in applying a combination operator to define a common embedding for the multiple sources materialized by different existing trained models. The proposed operator relies on an auto-encoder, trained on a large dataset, efficient both in terms of compression ratio and transfer learning performance. In a second step we exploit a distillation approach to obtain a lightweight student model mimicking the collection of the fused existing models. This model outperforms its teacher on novel tasks, achieving results on par with state-of-the-art methods on 15 facial analysis tasks (and domains), at an affordable training cost. Moreover, this student has 75 times less parameters than the original teacher and can be applied to a variety of novel face-related tasks.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Dive into Ambiguity: Latent Distribution Mining and Pairwise Uncertainty Estimation for Facial Expression Recognition
Jiahui She,Yibo Hu,Hailin Shi,Jun Wang,Qiu Shen,Tao Mei +5 more
- 01 Apr 2021
TL;DR: DMUE as mentioned in this paper proposes an auxiliary multi-branch learning framework to better mine and describe the latent distribution in the label space, and the pairwise relationship of semantic feature between instances is fully exploited to estimate the ambiguity extent in the instance space.
Artificial intelligence to evaluate postoperative pain based on facial expression recognition
Denys Fontaine,Valentin Vielzeuf,Philippe Genestier,Pascal Limeux,Serena Santucci‐Sivilotto,Emmanuel Mory,Nelly Darmon,Michel Lanteri-Minet,May Mokhtar,Melanie Laine,Damien Vistoli +10 more
TL;DR: Pain intensity evaluation by self‐report is difficult and biased in non‐communicating people, which may contribute to inappropriate pain management.
35
Pre-training Strategies and Datasets for Facial Representation Learning
Adrian Bulat,Shiyang Cheng,Jing Yang,Andrew Garbett,Ernesto Sánchez Sánchez,Georgios Tzimiropoulos +5 more
TL;DR: Pre-training strategies and datasets for facial representation learning are explored. Unsupervised pre-training and the impact of training datasets are investigated. Findings suggest that unsupervised pre-training and reduction of dataset redundancy are beneficial for facial representation learning.
Dynamic Gesture Recognition Based on Three-Stream Coordinate Attention Network and Knowledge Distillation
01 Jan 2023
TL;DR: Wang et al. as mentioned in this paper presented a dynamic gesture recognition method named 3SCKI based on a three-stream coordinate attention (CA) network, knowledge distillation, and image-text contrastive learning.
Dynamic Gesture Recognition Based on Three-Stream Coordinate Attention Network and Knowledge Distillation
TL;DR: Wang et al. as discussed by the authors presented a dynamic gesture recognition method named 3SCKI based on a three-stream coordinate attention (CA) network, knowledge distillation, and image-text contrastive learning.
5
References
Deep Residual Learning for Image Recognition
Kaiming He,Xiangyu Zhang,Shaoqing Ren,Jian Sun +3 more
- 27 Jun 2016
TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.
ImageNet: A large-scale hierarchical image database
Jia Deng,Wei Dong,Richard Socher,Li-Jia Li,Kai Li,Li Fei-Fei +5 more
- 20 Jun 2009
TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.
•Proceedings Article
Auto-Encoding Variational Bayes
Diederik P. Kingma,Max Welling +1 more
- 01 Jan 2014
TL;DR: A stochastic variational inference and learning algorithm that scales to large datasets and, under some mild differentiability conditions, even works in the intractable case is introduced.
•Posted Content
Distilling the Knowledge in a Neural Network
TL;DR: This work shows that it can significantly improve the acoustic model of a heavily used commercial system by distilling the knowledge in an ensemble of models into a single model and introduces a new type of ensemble composed of one or more full models and many specialist models which learn to distinguish fine-grained classes that the full models confuse.
21.2K
FaceNet: A Unified Embedding for Face Recognition and Clustering
TL;DR: FaceNet as discussed by the authors uses a deep convolutional network trained to directly optimize the embedding itself, rather than an intermediate bottleneck layer as in previous deep learning approaches, and achieves state-of-the-art face recognition performance using only 128 bytes per face.
14.2K
Related Papers (5)
Hang Zhao,Qing Liu,Yun Yang +2 more
- 13 Jun 2018
Pengguang Chen,Shu Liu,Hengshuang Zhao,Jiaya Jia +3 more
- 20 Jun 2021