LANTSA: Landmark-based transferable subspace analysis for single-cell and spatial transcriptomics
1
TL;DR: This work demonstrated the superiority of LANTSA to identify accurate data structures via clustering evaluation on benchmark datasets of various scRNA-seq protocols, 10x Visium, and Slide-seq ST platforms and confirmed the integration capability of LantSA to transfer cell annotation on large-scale and cross-platform sc RNA-seq datasets.
read more
Abstract: Single-cell RNA sequencing (scRNA-seq) and spatial transcriptomics (ST) technologies provide new insights to understand tissue organization and biological function. Accurately capturing the relationships of samples (e.g., sequenced cells, spatial locations) will result in reliable and consistent outcomes in downstream analyses. However, this undertaking remains a challenge for large-volume or cross-platform datasets due to transcriptional heterogeneity and high computational demands. Here, we introduce landmark-based transferable subspace analysis (LANTSA) to solve such challenges for scRNA-seq and ST datasets. Specifically, LANTSA constructs a representation graph of samples for clustering and visualization based on a novel subspace model, which can learn a more accurate representation and is theoretically proven to be linearly proportional to data size in terms of the time consumption. Furthermore, LANTSA uses a dimensionality reduction technique as an integrative method to extract the discriminants underlying the representation structure, which enables label transfer from one (learning) dataset (i.e., scRNA-seq profiles) to the other (prediction) datasets (e.g., scRNA-seq or ST profiles), thus solving the massive-volume or cross-platform problem. We demonstrated the superiority of LANTSA to identify accurate data structures via clustering evaluation on benchmark datasets of various scRNA-seq protocols, 10x Visium, and Slide-seq ST platforms. Moreover, we confirmed the integration capability of LANTSA to transfer cell annotation on large-scale and cross-platform scRNA-seq datasets. Finally, we validated the effectiveness of LANTSA for the identification of multiple mouse brain areas as well as the spatial mapping of cell types within cortical layers by integrating scRNA-seq and ST data.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Spatial domains identification in spatial transcriptomics by domain knowledge-aware and subspace-enhanced graph contrastive learning
TL;DR: GRAS4T is a novel graph contrastive learning framework for spatial domain identification in spatial transcriptomics, leveraging domain knowledge and subspace-enhanced graph contrastive learning to accurately distinguish different spatial domains.
References
Comprehensive Integration of Single-Cell Data.
Tim Stuart,Andrew Butler,Paul J. Hoffman,Christoph Hafemeister,Efthymia Papalexi,William M. Mauck,Yuhan Hao,Marlon Stoeckius,Peter Smibert,Rahul Satija +9 more
TL;DR: A strategy to "anchor" diverse datasets together, enabling us to integrate single-cell measurements not only across scRNA-seq technologies, but also across different modalities.
13.4K
Integrating single-cell transcriptomic data across different conditions, technologies, and species.
TL;DR: An analytical strategy for integrating scRNA-seq data sets based on common sources of variation is introduced, enabling the identification of shared populations across data sets and downstream comparative analysis.
Metascape provides a biologist-oriented resource for the analysis of systems-level datasets.
Yingyao Zhou,Bin Zhou,Lars Pache,Max W. Chang,Alireza Hadj Khodabakhshi,Olga Tanaseichuk,Christopher Benner,Sumit K. Chanda +7 more
TL;DR: A biologist-oriented portal that provides a gene list annotation, enrichment and interactome resource and enables integrated analysis of multi-OMICs datasets, Metascape is an effective and efficient tool for experimental biologists to comprehensively analyze and interpret OMICs-based studies in the big data era.
Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets
Evan Z. Macosko,Evan Z. Macosko,Anindita Basu,Anindita Basu,Rahul Satija,Rahul Satija,James Nemesh,James Nemesh,Karthik Shekhar,Melissa Goldman,Melissa Goldman,Itay Tirosh,Allison R. Bialas,Nolan Kamitaki,Nolan Kamitaki,Emily M. Martersteck,John J. Trombetta,David A. Weitz,Joshua R. Sanes,Alex K. Shalek,Alex K. Shalek,Alex K. Shalek,Aviv Regev,Aviv Regev,Aviv Regev,Steven A. McCarroll,Steven A. McCarroll +26 more
TL;DR: Drop-seq will accelerate biological discovery by enabling routine transcriptional profiling at single-cell resolution by separating them into nanoliter-sized aqueous droplets, associating a different barcode with each cell's RNAs, and sequencing them all together.
7.3K
Massively parallel digital transcriptional profiling of single cells
Grace X.Y. Zheng,Jessica M. Terry,Phillip Belgrader,Paul Ryvkin,Zachary Bent,Ryan Wilson,Solongo B. Ziraldo,Tobias Daniel Wheeler,Geoffrey P. McDermott,Junjie Zhu,Mark T. Gregory,Joe Shuga,Luz Montesclaros,Jason G. Underwood,Donald A. Masquelier,Stefanie Y. Nishimura,Michael Schnall-Levin,Paul Wyatt,Christopher Hindson,Rajiv Bharadwaj,Alexander Wong,Kevin D. Ness,Lan Beppu,H. Joachim Deeg,Christopher McFarland,Keith R. Loeb,Keith R. Loeb,William J. Valente,William J. Valente,Nolan G. Ericson,Emily A. Stevens,Jerald P. Radich,Tarjei S. Mikkelsen,Benjamin J. Hindson,Jason H. Bielas +34 more
TL;DR: A droplet-based system that enables 3′ mRNA counting of tens of thousands of single cells per sample is described and sequence variation in the transcriptome data is used to determine host and donor chimerism at single-cell resolution from bone marrow mononuclear cells isolated from transplant patients.
Related Papers (5)
Gang Zhang,Jiansheng Chen +1 more
- 21 Sep 2016
Lih-Heng Chan,Sh-Hussain Salleh,Chee-Ming Ting +2 more
- 25 May 2009