Book Chapter10.1007/978-3-642-15567-3_38
Robust head pose estimation using supervised manifold learning
Chiraz BenAbdelkader
- 05 Sep 2010
- pp 518-531
78
TL;DR: This paper presents a taxonomy of methods for incorporating continuous pose angle information into one or more stages of the manifold learning process, and discusses its implementation for Neighborhood Preserving Embedding and Locality Preserving Projection.
read more
Abstract: We address the problem of fine-grain head pose angle estimation from a single 2D face image as a continuous regression problem. Currently the state of the art, and a promising line of research, on head pose estimation seems to be that of nonlinear manifold embedding techniques, which learn an "optimal" low-dimensional manifold that models the nonlinear and continuous variation of face appearance with pose angle. Furthermore, supervised manifold learning techniques attempt to achieve this robustly in the presence of latent variables in the training set (especially identity, illumination, and facial expression), by incorporating head pose angle information accompanying the training samples. Most of these techniques are designed with the classification scenario in mind, however, and are not directly applicable to the regression scenario where continuous numeric values (pose angles), rather than class labels (discrete poses), are available. In this paper, we propose to deal with the regression case in a principled way. We present a taxonomy of methods for incorporating continuous pose angle information into one or more stages of the manifold learning process, and discuss its implementation for Neighborhood Preserving Embedding (NPE) and Locality Preserving Projection (LPP). Experiments are carried out on a face dataset containing significant identity and illumination variations, and the results show that our regression-based approach far outperforms previous supervised manifold learning methods for head pose estimation.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Multimodal Face-Pose Estimation With Multitask Manifold Deep Learning
TL;DR: A novel face-pose estimation framework named multitask manifold deep learning, based on feature extraction with improved convolutional neural networks (CNNs) and multimodal mapping relationship with multitask learning is proposed.
Deep Head Pose: Gaze-Direction Estimation in Multimodal Video
TL;DR: A convolutional neural network (CNN)-based model for human head pose estimation in low-resolution multi-modal RGB-D data is presented and it is shown that many higher level scene understanding like human-human/scene interaction detection can be achieved.
217
Multimodal Face-Pose Estimation With Multitask Manifold Deep Learning
TL;DR: A novel face-pose estimation framework named multitask manifold deep learning, based on feature extraction with improved convolutional neural networks (CNNs) and multimodal mapping relationship with multitask learning is proposed.
100
Robust Head-Pose Estimation Based on Partially-Latent Mixture of Linear Regressions
TL;DR: In this article, a mixture of linear regressions with partially-latent output is used to map high-dimensional feature vectors (extracted from bounding boxes of faces) onto the joint space of head-pose angles and bounding-box shifts, such that they are robustly predicted in the presence of unobservable phenomena.
88
Improving head pose estimation using two-stage ensembles with top-k regression
TL;DR: A novel head pose estimation method using two-stage ensembles with average top-k regression based on the former prediction and considering the task-dependent weights instead of setting coefficients by grid search is introduced.
66
References
Nonlinear dimensionality reduction by locally linear embedding.
Sam T. Roweis,Lawrence K. Saul +1 more
TL;DR: Locally linear embedding (LLE) is introduced, an unsupervised learning algorithm that computes low-dimensional, neighborhood-preserving embeddings of high-dimensional inputs that learns the global structure of nonlinear manifolds.
A global geometric framework for nonlinear dimensionality reduction.
TL;DR: An approach to solving dimensionality reduction problems that uses easily measured local metric information to learn the underlying global geometry of a data set and efficiently computes a globally optimal solution, and is guaranteed to converge asymptotically to the true structure.
Laplacian Eigenmaps for dimensionality reduction and data representation
Mikhail Belkin,Partha Niyogi +1 more
TL;DR: In this article, the authors proposed a geometrically motivated algorithm for representing high-dimensional data, based on the correspondence between the graph Laplacian, the Laplace Beltrami operator on the manifold and the connections to the heat equation.
•Proceedings Article
Locality Preserving Projections
Xiaofei He,Partha Niyogi +1 more
- 09 Dec 2003
TL;DR: These are linear projective maps that arise by solving a variational problem that optimally preserves the neighborhood structure of the data set by finding the optimal linear approximations to the eigenfunctions of the Laplace Beltrami operator on the manifold.
Face recognition using Laplacianfaces
TL;DR: Experimental results suggest that the proposed Laplacianface approach provides a better representation and achieves lower error rates in face recognition.