Towards a General Model of Knowledge for Facial Analysis by Multi-Source Transfer Learning

Open Access

Towards a General Model of Knowledge for Facial Analysis by Multi-Source Transfer Learning

- 01 Mar 2020

pp 241-251

9

TL;DR: In this article, a multi-source transfer learning approach is proposed to obtain general models of knowledge for facial analysis, which consists in two successive training steps: the first one consists in applying a combination operator to define a common embedding for the multiple sources materialized by different existing trained models.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Proceedings Article•10.1109/CVPR46437.2021.00618

Dive into Ambiguity: Latent Distribution Mining and Pairwise Uncertainty Estimation for Facial Expression Recognition

Jiahui She, +5 more

- 01 Apr 2021

TL;DR: DMUE as mentioned in this paper proposes an auxiliary multi-branch learning framework to better mine and describe the latent distribution in the label space, and the pairwise relationship of semantic feature between instances is fully exploited to estimate the ambiguity extent in the instance space.

...read moreread less

237

Journal Article•10.1002/ejp.1948

Artificial intelligence to evaluate postoperative pain based on facial expression recognition

Denys Fontaine, +10 more

- 30 Mar 2022

- European Journal of Pain

TL;DR: Pain intensity evaluation by self‐report is difficult and biased in non‐communicating people, which may contribute to inappropriate pain management.

...read moreread less

35

Book Chapter•10.1007/978-3-031-19778-9_7

Pre-training Strategies and Datasets for Facial Representation Learning

Adrian Bulat, +5 more

- 01 Jan 2022

- Lecture Notes in Computer Science

TL;DR: Pre-training strategies and datasets for facial representation learning are explored. Unsupervised pre-training and the impact of training datasets are investigated. Findings suggest that unsupervised pre-training and reduction of dataset redundancy are beneficial for facial representation learning.

...read moreread less

22

•Journal Article•10.1109/access.2023.3278100

Dynamic Gesture Recognition Based on Three-Stream Coordinate Attention Network and Knowledge Distillation

01 Jan 2023

- IEEE Access

TL;DR: Wang et al. as mentioned in this paper presented a dynamic gesture recognition method named 3SCKI based on a three-stream coordinate attention (CA) network, knowledge distillation, and image-text contrastive learning.

...read moreread less

5

Journal Article•10.1109/ACCESS.2023.3278100

Dynamic Gesture Recognition Based on Three-Stream Coordinate Attention Network and Knowledge Distillation

Shan Shan Wan, +2 more

- IEEE Access

TL;DR: Wang et al. as discussed by the authors presented a dynamic gesture recognition method named 3SCKI based on a three-stream coordinate attention (CA) network, knowledge distillation, and image-text contrastive learning.

...read moreread less

5

References

•Proceedings Article•10.1109/CVPR.2016.90

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 27 Jun 2016

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

198.7K

Proceedings Article•10.1109/CVPR.2009.5206848

ImageNet: A large-scale hierarchical image database

Jia Deng, +5 more

- 20 Jun 2009

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

75.9K

•Proceedings Article

Auto-Encoding Variational Bayes

Diederik P. Kingma, +1 more

- 01 Jan 2014

TL;DR: A stochastic variational inference and learning algorithm that scales to large datasets and, under some mild differentiability conditions, even works in the intractable case is introduced.

...read moreread less

28.9K

•Posted Content

Distilling the Knowledge in a Neural Network

Geoffrey E. Hinton, +2 more

- 09 Mar 2015

- arXiv: Machine Learning

TL;DR: This work shows that it can significantly improve the acoustic model of a heavily used commercial system by distilling the knowledge in an ensemble of models into a single model and introduces a new type of ensemble composed of one or more full models and many specialist models which learn to distinguish fine-grained classes that the full models confuse.

...read moreread less

21.2K

•Proceedings Article•10.1109/CVPR.2015.7298682

FaceNet: A Unified Embedding for Face Recognition and Clustering

Florian Schroff, +2 more

- 12 Mar 2015

- arXiv: Computer Vision and Pattern Recog...

TL;DR: FaceNet as discussed by the authors uses a deep convolutional network trained to directly optimize the embedding itself, rather than an intermediate bottleneck layer as in previous deep learning approaches, and achieves state-of-the-art face recognition performance using only 128 bytes per face.

...read moreread less

14.2K

...

Expand

Towards a General Model of Knowledge for Facial Analysis by Multi-Source Transfer Learning

Chat with Paper

AI Agents for this Paper

Citations

Dive into Ambiguity: Latent Distribution Mining and Pairwise Uncertainty Estimation for Facial Expression Recognition

Artificial intelligence to evaluate postoperative pain based on facial expression recognition

Pre-training Strategies and Datasets for Facial Representation Learning

Dynamic Gesture Recognition Based on Three-Stream Coordinate Attention Network and Knowledge Distillation

Dynamic Gesture Recognition Based on Three-Stream Coordinate Attention Network and Knowledge Distillation

References

Deep Residual Learning for Image Recognition

ImageNet: A large-scale hierarchical image database

Auto-Encoding Variational Bayes

Distilling the Knowledge in a Neural Network

FaceNet: A Unified Embedding for Face Recognition and Clustering

Related Papers (5)

ProxylessKD: Direct Knowledge Distillation with Inherited Classifier for Face Recognition

Multi-view transfer learning with privileged learning framework

Transfer Learning with Ensemble of Multiple Feature Representations

Knowledge Distillation for End-to-End Person Search

Distilling Knowledge via Knowledge Review