Joint multi-omics discriminant analysis with consistent representation learning using PANDA

doi:10.21203/rs.3.rs-4353037/v1

Journal Article10.21203/rs.3.rs-4353037/v1

Joint multi-omics discriminant analysis with consistent representation learning using PANDA

Jia Wu, +18 more

- 17 May 2024

- Research square

1

TL;DR: PANDA is a joint multi-omics discriminant analysis method that jointly learns consistent discriminant latent representations for each omics, minimizing the differences in distributions among omics and maximizing between-class and minimizing within-class omics variations in a common space.

Abstract: Abstract Integrative multi-omics analysis provides deeper insight and enables better and more realistic modeling of the underlying biology and causes of diseases than does single omics analysis. Although several integrative multi-omics analysis methods have been proposed and demonstrated promising results in integrating distinct omics datasets, inconsistent distribution of the different omics data, which is caused by technology variations, poses a challenge for paired integrative multi-omics methods. In addition, the existing discriminant analysis–based integrative methods do not effectively exploit correlation and consistent discriminant structures, necessitating a compromise between correlation and discrimination in using these methods. Herein we present PAN-omics Discriminant Analysis (PANDA), a joint discriminant analysis method that seeks omics-specific discriminant common spaces by jointly learning consistent discriminant latent representations for each omics. PANDA jointly maximizes between-class and minimizes within-class omics variations in a common space and simultaneously models the relationships among omics at the consistency representation and cross-omics correlation levels, overcoming the need for compromise between discrimination and correlation as with the existing integrative multi-omics methods. Because of the consistency representation learning incorporated into the objective function of PANDA, this method seeks a common discriminant space to minimize the differences in distributions among omics, can lead to a more robust latent representations than other methods, and is against the inconsistency of the different omics. We compared PANDA to 10 other state-of-the-art multi-omics data integration methods using both simulated and real-world multi-omics datasets and found that PANDA consistently outperformed them while providing meaningful discriminant latent representations. PANDA is implemented using both R and MATLAB, with codes available at https://github.com/WuLabMDA/PANDA.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.3389/fendo.2025.1684492

Gut-ovary axis in polycystic ovary syndrome: mechanistic insights and gut microbiota-targeted therapeutic strategies

Mei Zhao, +6 more

- 05 Nov 2025

- Frontiers in Endocrinology

Abstract: Polycystic ovary syndrome (PCOS) is a common endocrine and metabolic disorder that significantly affects women’s reproductive health and quality of life. Its pathogenesis involves multiple factors, including genetics, environment, and metabolism. In recent years, with the growing body of research on PCOS, the “gut-ovary axis” hypothesis has become a prominent research focus. This hypothesis suggests that an imbalance in gut bacteria may significantly influence the onset and progression of PCOS through various pathways, such as immune regulation, metabolic disturbances, and hormonal imbalances. This article aims to review the role of the “gut-ovary axis” in PCOS and to explore novel treatment strategies based on gut microbiota modulation, including probiotics, fecal microbiota transplantation, and dietary interventions. These strategies represent promising research avenues for future PCOS treatments, with preliminary studies demonstrating their potential to improve clinical symptoms. However, it is crucial to note that these are not yet established therapies and require substantial further validation. Novelty and Significance of this Review: This review moves beyond a descriptive catalog of associations to provide a critical appraisal of the gut-ovary axis in PCOS. We systematically differentiate well-established mechanisms from speculative hypotheses, explicitly identify persistent knowledge gaps, and evaluate the translational potential of microbiota-targeted therapies, thereby offering a refined framework for future basic and clinical research.

...read moreread less

References

Journal Article•10.1111/J.2517-6161.1995.TB02031.X

Controlling the false discovery rate: a practical and powerful approach to multiple testing

Yoav Benjamini, +1 more

- 01 Jan 1995

- Journal of the royal statistical society...

TL;DR: In this paper, a different approach to problems of multiple significance testing is presented, which calls for controlling the expected proportion of falsely rejected hypotheses -the false discovery rate, which is equivalent to the FWER when all hypotheses are true but is smaller otherwise.

...read moreread less

104.5K

•Journal Article•10.1101/GR.1239303

Cytoscape: A Software Environment for Integrated Models of Biomolecular Interaction Networks

Paul Shannon, +8 more

- 01 Nov 2003

- Genome Research

TL;DR: Several case studies of Cytoscape plug-ins are surveyed, including a search for interaction pathways correlating with changes in gene expression, a study of protein complexes involved in cellular recovery to DNA damage, inference of a combined physical/functional interaction network for Halobacterium, and an interface to detailed stochastic/kinetic gene regulatory models.

...read moreread less

46.4K

Journal Article•10.1038/44565

Learning the parts of objects by non-negative matrix factorization

Daniel D. Lee, +2 more

- 21 Oct 1999

- Nature

TL;DR: An algorithm for non-negative matrix factorization is demonstrated that is able to learn parts of faces and semantic features of text and is in contrast to other methods that learn holistic, not parts-based, representations.

...read moreread less

14.2K

•Journal Article•10.1016/J.CELL.2021.04.048

Integrated analysis of multimodal single-cell data

Yuhan Hao, +24 more

- 24 Jun 2021

- Cell

TL;DR: Weighted-nearest neighbor analysis as mentioned in this paper is an unsupervised framework to learn the relative utility of each data type in each cell, enabling an integrative analysis of multiple modalities.

...read moreread less

9.8K

•Journal Article•10.1016/S0893-6080(00)00026-5

Independent component analysis: algorithms and applications

Aapo Hyvärinen, +1 more

- 01 May 2000

- Neural Networks

TL;DR: The basic theory and applications of ICA are presented, and the goal is to find a linear representation of non-Gaussian data so that the components are statistically independent, or as independent as possible.

...read moreread less

9.7K

...

Expand