Journal Article10.21203/rs.3.rs-4353037/v1
Joint multi-omics discriminant analysis with consistent representation learning using PANDA
Jia Wu,M Mohammad Aminu,Lingzhi Hong,Natalie Vokes,Stephanie Schmidt,Maliazurina Saad,Bo Zhu,Tina Cascone,Ajay Sheshadri,David Jaffray,P. Andrew Futreal,Jack Lee,Lauren Byers,Don Gibbons,John Heymach,Ken Chen,Chao Cheng,Jianjun Zhang,Bo Wang +18 more
1
TL;DR: PANDA is a joint multi-omics discriminant analysis method that jointly learns consistent discriminant latent representations for each omics, minimizing the differences in distributions among omics and maximizing between-class and minimizing within-class omics variations in a common space.
read more
Abstract: Abstract Integrative multi-omics analysis provides deeper insight and enables better and more realistic modeling of the underlying biology and causes of diseases than does single omics analysis. Although several integrative multi-omics analysis methods have been proposed and demonstrated promising results in integrating distinct omics datasets, inconsistent distribution of the different omics data, which is caused by technology variations, poses a challenge for paired integrative multi-omics methods. In addition, the existing discriminant analysis–based integrative methods do not effectively exploit correlation and consistent discriminant structures, necessitating a compromise between correlation and discrimination in using these methods. Herein we present PAN-omics Discriminant Analysis (PANDA), a joint discriminant analysis method that seeks omics-specific discriminant common spaces by jointly learning consistent discriminant latent representations for each omics. PANDA jointly maximizes between-class and minimizes within-class omics variations in a common space and simultaneously models the relationships among omics at the consistency representation and cross-omics correlation levels, overcoming the need for compromise between discrimination and correlation as with the existing integrative multi-omics methods. Because of the consistency representation learning incorporated into the objective function of PANDA, this method seeks a common discriminant space to minimize the differences in distributions among omics, can lead to a more robust latent representations than other methods, and is against the inconsistency of the different omics. We compared PANDA to 10 other state-of-the-art multi-omics data integration methods using both simulated and real-world multi-omics datasets and found that PANDA consistently outperformed them while providing meaningful discriminant latent representations. PANDA is implemented using both R and MATLAB, with codes available at https://github.com/WuLabMDA/PANDA.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Gut-ovary axis in polycystic ovary syndrome: mechanistic insights and gut microbiota-targeted therapeutic strategies
Mei Zhao,Mei Zhao,Danlin Chen,Xiumei Hu,Caiping Xie,Lian-wei Xu,Fuhua Zhou +6 more
Abstract: Polycystic ovary syndrome (PCOS) is a common endocrine and metabolic disorder that significantly affects women’s reproductive health and quality of life. Its pathogenesis involves multiple factors, including genetics, environment, and metabolism. In recent years, with the growing body of research on PCOS, the “gut-ovary axis” hypothesis has become a prominent research focus. This hypothesis suggests that an imbalance in gut bacteria may significantly influence the onset and progression of PCOS through various pathways, such as immune regulation, metabolic disturbances, and hormonal imbalances. This article aims to review the role of the “gut-ovary axis” in PCOS and to explore novel treatment strategies based on gut microbiota modulation, including probiotics, fecal microbiota transplantation, and dietary interventions. These strategies represent promising research avenues for future PCOS treatments, with preliminary studies demonstrating their potential to improve clinical symptoms. However, it is crucial to note that these are not yet established therapies and require substantial further validation. Novelty and Significance of this Review: This review moves beyond a descriptive catalog of associations to provide a critical appraisal of the gut-ovary axis in PCOS. We systematically differentiate well-established mechanisms from speculative hypotheses, explicitly identify persistent knowledge gaps, and evaluate the translational potential of microbiota-targeted therapies, thereby offering a refined framework for future basic and clinical research.
References
Controlling the false discovery rate: a practical and powerful approach to multiple testing
Yoav Benjamini,Yosef Hochberg +1 more
TL;DR: In this paper, a different approach to problems of multiple significance testing is presented, which calls for controlling the expected proportion of falsely rejected hypotheses -the false discovery rate, which is equivalent to the FWER when all hypotheses are true but is smaller otherwise.
Cytoscape: A Software Environment for Integrated Models of Biomolecular Interaction Networks
Paul Shannon,Andrew Markiel,Owen Ozier,Nitin S. Baliga,Jonathan T. Wang,Daniel Ramage,Nada Amin,Benno Schwikowski,Trey Ideker +8 more
TL;DR: Several case studies of Cytoscape plug-ins are surveyed, including a search for interaction pathways correlating with changes in gene expression, a study of protein complexes involved in cellular recovery to DNA damage, inference of a combined physical/functional interaction network for Halobacterium, and an interface to detailed stochastic/kinetic gene regulatory models.
Learning the parts of objects by non-negative matrix factorization
TL;DR: An algorithm for non-negative matrix factorization is demonstrated that is able to learn parts of faces and semantic features of text and is in contrast to other methods that learn holistic, not parts-based, representations.
14.2K
Integrated analysis of multimodal single-cell data
Yuhan Hao,Stephanie Hao,Erica Andersen-Nissen,William M. Mauck,Shiwei Zheng,Andrew Butler,Maddie Jane Lee,Aaron J. Wilk,Charlotte A. Darby,Michael Zager,Paul Hoffman,Marlon Stoeckius,Efthymia Papalexi,Eleni P. Mimitou,Jaison Jain,Avi Srivastava,Tim Stuart,Lamar M. Fleming,Bertrand Z. Yeung,Angela J. Rogers,Juliana M. McElrath,Catherine A. Blish,Raphael Gottardo,Peter Smibert,Rahul Satija +24 more
TL;DR: Weighted-nearest neighbor analysis as mentioned in this paper is an unsupervised framework to learn the relative utility of each data type in each cell, enabling an integrative analysis of multiple modalities.
9.8K
Independent component analysis: algorithms and applications
Aapo Hyvärinen,Erkki Oja +1 more
TL;DR: The basic theory and applications of ICA are presented, and the goal is to find a linear representation of non-Gaussian data so that the components are statistically independent, or as independent as possible.
9.7K