Compositional data

Topic Tools

Papers published on a yearly basis

Papers

Journal Article•10.2307/2982045•

The Statistical Analysis of Compositional Data

[...]

M. C. Jones¹, John Aitchison•Institutions (1)

University of Bath¹

1 Jul 1987

4,148 citations

Journal Article•10.3389/FMICB.2017.02224•

Microbiome Datasets Are Compositional: And This Is Not Optional.

[...]

Gregory B. Gloor¹, Jean M. Macklaim¹, Vera Pawlowsky-Glahn, Juan José Egozcue²•Institutions (2)

University of Western Ontario¹, Polytechnic University of Catalonia²

15 Nov 2017-Frontiers in Microbiology

TL;DR: The purpose of this review is to alert investigators to the dangers inherent in ignoring the compositional nature of the data, and point out that HTS datasets derived from microbiome studies can and should be treated as compositions at all stages of analysis.

...read moreread less

Abstract: Datasets collected by high-throughput sequencing (HTS) of 16S rRNA gene amplimers, metagenomes or metatranscriptomes are commonplace and being used to study human disease states, ecological differences between sites, and the built environment. There is increasing awareness that microbiome datasets generated by HTS are compositional because they have an arbitrary total imposed by the instrument. However, many investigators are either unaware of this or assume specific properties of the compositional data. The purpose of this review is to alert investigators to the dangers inherent in ignoring the compositional nature of the data, and point out that HTS datasets derived from microbiome studies can and should be treated as compositions at all stages of analysis. We briefly introduce compositional data, illustrate the pathologies that occur when compositional data are analyzed inappropriately, and finally give guidance and point to resources and examples for the analysis of microbiome datasets using compositional data analysis.

...read moreread less

2,413 citations

Dissertation•10.5353/TH_B3123037•

The statistical analysis of compositional data

[...]

Shir-ming. Shen, 沈雪明

1 Jan 1983

1,017 citations

Book•10.1002/9781119976462•

Compositional data analysis : theory and applications

[...]

Vera Pawlowsky-Glahn, Antonella Buccianti

23 Sep 2011

TL;DR: This book presents the history and development of compositional data analysis along with Aitchison's log-ratio approach, and describes the state of the art both in theoretical fields as well as applications in the different fields of science.

...read moreread less

Abstract: It is difficult to imagine that the statistical analysis of compositional data has been a major issue of concern for more than 100 years. It is even more difficult to realize that so many statisticians and users of statistics are unaware of the particular problems affecting compositional data, as well as their solutions. The issue of ``spurious correlation'', as the situation was phrased by Karl Pearson back in 1897, affects all data that measures parts of some whole, such as percentages, proportions, ppm and ppb. Such measurements are present in all fields of science, ranging from geology, biology, environmental sciences, forensic sciences, medicine and hydrology. This book presents the history and development of compositional data analysis along with Aitchison's log-ratio approach. Compositional Data Analysis describes the state of the art both in theoretical fields as well as applications in the different fields of science.

...read moreread less

750 citations

Journal Article•10.1111/1467-9876.00275•

Biplots of compositional data

[...]

John Aitchison¹, Michael Greenacre²•Institutions (2)

University of Glasgow¹, Pompeu Fabra University²

01 Oct 2002-Journal of The Royal Statistical Society Series C-applied Statistics

TL;DR: The singular value decomposition and its interpretation as a linear biplot have proved to be a powerful tool for analysing many forms of multivariate data as discussed by the authors, including compositional data consisting of positive vectors.

...read moreread less

Abstract: Summary. The singular value decomposition and its interpretation as a linear biplot have proved to be a powerful tool for analysing many forms of multivariate data. Here we adapt biplot methodology to the specific case of compositional data consisting of positive vectors each of which is constrained to have unit sum. These relative variation biplots have properties relating to the special features of compositional data: the study of ratios, subcompositions and models of compositional relationships. The methodology is applied to a data set consisting of six-part colour compositions in 22 abstract paintings, showing how the singular value decomposition can achieve an accurate biplot of the colour ratios and how possible models interrelating the colours can be diagnosed.

...read moreread less

702 citations

...

Expand

Year	Papers
2025	14
2024	30
2023	44
2022	79
2021	71
2020	39

Topic Tools

Papers published on a yearly basis

Papers

The Statistical Analysis of Compositional Data

Microbiome Datasets Are Compositional: And This Is Not Optional.

The statistical analysis of compositional data

Compositional data analysis : theory and applications

Biplots of compositional data

Related Topics (5)

Performance Metrics