Open AccessProceedings Article
Conditional Visual Tracking in Kernel Space
Cristian Sminchisescu,Atul Kanujia,Zhiguo Li,Dimitris N. Metaxas +3 more
- 05 Dec 2005
- Vol. 18, pp 1249-1256
TL;DR: This work combines kernel PCA-based non-linear dimensionality reduction (kPCA) and Conditional Bayesian Mixture of Experts (BME) in order to learn complex multivalued predictors between observations and model hidden states for accurate, inverse, visual perception inferences.
read more
Abstract: We present a conditional temporal probabilistic framework for reconstructing 3D human motion in monocular video based on descriptors encoding image silhouette observations. For computational efficiency we restrict visual inference to low-dimensional kernel induced non-linear state spaces. Our methodology (kBME) combines kernel PCA-based non-linear dimensionality reduction (kPCA) and Conditional Bayesian Mixture of Experts (BME) in order to learn complex multivalued predictors between observations and model hidden states. This is necessary for accurate, inverse, visual perception inferences, where several probable, distant 3D solutions exist due to noise or the uncertainty of monocular perspective projection. Low-dimensional models are appropriate because many visual processes exhibit strong non-linear correlations in both the image observations and the target, hidden state variables. The learned predictors are temporally combined within a conditional graphical model in order to allow a principled propagation of uncertainty. We study several predictors and empirically show that the proposed algorithm positively compares with techniques based on regression, Kernel Dependency Estimation (KDE) or PCA alone, and gives results competitive to those of high-dimensional mixture predictors at a fraction of their computational cost. We show that the method successfully reconstructs the complex 3D motion of humans in real monocular video sequences.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Twin Gaussian Processes for Structured Prediction
Liefeng Bo,Cristian Sminchisescu +1 more
TL;DR: Twin Gaussian processes (TGP), a generic structured prediction method that uses Gaussian process priors on both covariates and responses, both multivariate, and estimates outputs by minimizing the Kullback-Leibler divergence between two GP modeled as normal distributions over finite index sets of training and testing examples, is described.
The Joint Manifold Model for Semi-supervised Multi-valued Regression
Ramanan Navaratnam,Andrew Fitzgibbon,Roberto Cipolla +2 more
- 26 Dec 2007
TL;DR: A Gaussian process latent variable model is used to learn the mapping from a shared latent low-dimensional manifold to the feature and parameter spaces and Experiments on synthetic and real problems demonstrate how the use of unlabelled data improves over existing techniques.
BM³E : Discriminative Density Propagation for Visual Tracking
TL;DR: The research establishes the density propagation rules for discriminative inference in continuous, temporal chain models and proposes flexible supervised and unsupervised algorithms to learn feed-forward, multivalued contextual mappings based on compact, conditional Bayesian mixture of experts models.
Monocular 3D tracking of articulated human motion in silhouette and pose manifolds
TL;DR: This paper presents a robust computational framework for monocular 3D tracking of human movement by constructing low-dimensional silhouettes and poses manifolds, establishing intermanifold mappings, and performing tracking in such manifolds using a particle filter.
•Journal Article
Tracking facial features using mixture of point distribution models
TL;DR: The contribution is to apply ASM to the task of tracking shapes involving wide aspect changes and generic movements by incorporating shape priors that are learned over non-linear shape space and using them to learn the plausible shape space.
27
References
Nonlinear component analysis as a kernel eigenvalue problem
TL;DR: A new method for performing a nonlinear form of principal component analysis by the use of integral operator kernel functions is proposed and experimental results on polynomial feature extraction for pattern recognition are presented.
Shape matching and object recognition using shape contexts
TL;DR: This paper presents work on computing shape models that are computationally fast and invariant basic transformations like translation, scaling and rotation, and proposes shape detection using a feature called shape context, which is descriptive of the shape of the object.
7.3K
Sparse bayesian learning and the relevance vector machine
TL;DR: It is demonstrated that by exploiting a probabilistic Bayesian learning framework, the 'relevance vector machine' (RVM) can derive accurate prediction models which typically utilise dramatically fewer basis functions than a comparable SVM while offering a number of additional advantages.
Bayesian interpolation
David J. C. MacKay
- 01 May 1992
TL;DR: The Bayesian approach to regularization and model-comparison is demonstrated by studying the inference problem of interpolating noisy data by examining the posterior probability distribution of regularizing constants and noise levels.
4.7K
Hierarchical mixtures of experts and the EM algorithm
TL;DR: An Expectation-Maximization (EM) algorithm for adjusting the parameters of the tree-structured architecture for supervised learning and an on-line learning algorithm in which the parameters are updated incrementally.
Related Papers (5)
Jack M. Wang,Aaron Hertzmann,David M. Blei +2 more
- 05 Dec 2005
Raquel Urtasun,David J. Fleet,Pascal Fua +2 more
- 17 Jun 2006