Sparse PCA

Topic Tools

Papers published on a yearly basis

Papers

Reference Entry•10.1002/0470013192.BSA501•

Principal Component Analysis

[...]

Ian T. Jolliffe¹•Institutions (1)

University of Aberdeen¹

15 Oct 2005

TL;DR: Principal component analysis (PCA) as discussed by the authors replaces the p original variables by a smaller number, q, of derived variables, the principal components, which are linear combinations of the original variables.

...read moreread less

Abstract: When large multivariate datasets are analyzed, it is often desirable to reduce their dimensionality. Principal component analysis is one technique for doing this. It replaces the p original variables by a smaller number, q, of derived variables, the principal components, which are linear combinations of the original variables. Often, it is possible to retain most of the variability in the original variables with q very much smaller than p. Despite its apparent simplicity, principal component analysis has a number of subtleties, and it has many uses and extensions. A number of choices associated with the technique are briefly discussed, namely, covariance or correlation, how many components, and different normalization constraints, as well as confusion with factor analysis. Various uses and extensions are outlined. Keywords: dimension reduction; factor analysis; multivariate analysis; variance maximization

...read moreread less

15,111 citations

Journal Article•10.1002/WICS.101•

Principal component analysis

[...]

Hervé Abdi¹, Lynne J. Williams²•Institutions (2)

University of Texas at Dallas¹, University of Toronto²

01 Jul 2010-Wiley Interdisciplinary Reviews: Computational Statistics

TL;DR: Principal component analysis (PCA) as discussed by the authors is a multivariate technique that analyzes a data table in which observations are described by several inter-correlated quantitative dependent variables, and its goal is to extract the important information from the table, to represent it as a set of new orthogonal variables called principal components, and display the pattern of similarity of the observations and of the variables as points in maps.

...read moreread less

Abstract: Principal component analysis PCA is a multivariate technique that analyzes a data table in which observations are described by several inter-correlated quantitative dependent variables. Its goal is to extract the important information from the table, to represent it as a set of new orthogonal variables called principal components, and to display the pattern of similarity of the observations and of the variables as points in maps. The quality of the PCA model can be evaluated using cross-validation techniques such as the bootstrap and the jackknife. PCA can be generalized as correspondence analysis CA in order to handle qualitative variables and as multiple factor analysis MFA in order to handle heterogeneous sets of variables. Mathematically, PCA depends upon the eigen-decomposition of positive semi-definite matrices and upon the singular value decomposition SVD of rectangular matrices. Copyright © 2010 John Wiley & Sons, Inc.

...read moreread less

8,695 citations

Journal Article•10.1145/1970392.1970395•

Robust principal component analysis

[...]

Emmanuel J. Candès¹, Xiaodong Li¹, Yi Ma², John Wright³•Institutions (3)

Stanford University¹, University of Illinois at Urbana–Champaign², Microsoft³

09 Jun 2011-Journal of the ACM

TL;DR: In this paper, the authors prove that under some suitable assumptions, it is possible to recover both the low-rank and the sparse components exactly by solving a very convenient convex program called Principal Component Pursuit; among all feasible decompositions, simply minimize a weighted combination of the nuclear norm and of the e1 norm.

...read moreread less

Abstract: This article is about a curious phenomenon. Suppose we have a data matrix, which is the superposition of a low-rank component and a sparse component. Can we recover each component individuallyq We prove that under some suitable assumptions, it is possible to recover both the low-rank and the sparse components exactly by solving a very convenient convex program called Principal Component Pursuit; among all feasible decompositions, simply minimize a weighted combination of the nuclear norm and of the e1 norm. This suggests the possibility of a principled approach to robust principal component analysis since our methodology and results assert that one can recover the principal components of a data matrix even though a positive fraction of its entries are arbitrarily corrupted. This extends to the situation where a fraction of the entries are missing as well. We discuss an algorithm for solving this optimization problem, and present applications in the area of video surveillance, where our methodology allows for the detection of objects in a cluttered background, and in the area of face recognition, where it offers a principled way of removing shadows and specularities in images of faces.

...read moreread less

8,174 citations

Journal Article•10.1098/RSTA.2015.0202•

Principal component analysis: a review and recent developments

[...]

Ian T. Jolliffe¹, Jorge Cadima², Jorge Cadima³•Institutions (3)

University of Exeter¹, University of Lisbon², Instituto Superior de Agronomia³

13 Apr 2016-Philosophical Transactions of the Royal Society A

TL;DR: The basic ideas of PCA are introduced, discussing what it can and cannot do, and some variants of the technique have been developed that are tailored to various different data types and structures.

...read moreread less

Abstract: Large datasets are increasingly common and are often difficult to interpret. Principal component analysis (PCA) is a technique for reducing the dimensionality of such datasets, increasing interpretability but at the same time minimizing information loss. It does so by creating new uncorrelated variables that successively maximize variance. Finding such new variables, the principal components, reduces to solving an eigenvalue/eigenvector problem, and the new variables are defined by the dataset at hand, not a priori , hence making PCA an adaptive data analysis technique. It is adaptive in another sense too, since variants of the technique have been developed that are tailored to various different data types and structures. This article will begin by introducing the basic ideas of PCA, discussing what it can and cannot do. It will then describe some variants of PCA and their application.

...read moreread less

7,488 citations

Journal Article•10.1111/1467-9868.00196•

Probabilistic Principal Component Analysis

[...]

Michael E. Tipping¹, Christopher M. Bishop¹•Institutions (1)

Microsoft¹

01 Jan 1999-Journal of The Royal Statistical Society Series B-statistical Methodology

TL;DR: In this paper, the principal axes of a set of observed data vectors may be determined through maximum-likelihood estimation of parameters in a latent variable model closely related to factor analysis.

...read moreread less

Abstract: Principal component analysis (PCA) is a ubiquitous technique for data analysis and processing, but one which is not based upon a probability model. In this paper we demonstrate how the principal axes of a set of observed data vectors may be determined through maximum-likelihood estimation of parameters in a latent variable model closely related to factor analysis. We consider the properties of the associated likelihood function, giving an EM algorithm for estimating the principal subspace iteratively, and discuss the advantages conveyed by the definition of a probability density function for PCA.

...read moreread less

4,057 citations

...

Expand

Year	Papers
2025	1
2024	1
2023	23
2022	53
2021	35
2020	47

Topic Tools

Papers published on a yearly basis

Papers

Principal Component Analysis

Principal component analysis

Robust principal component analysis

Principal component analysis: a review and recent developments

Probabilistic Principal Component Analysis

Related Topics (5)

Performance Metrics