Fast, sensitive and accurate integration of single-cell data with Harmony.
Ilya Korsunsky,Nghia Millard,Jean Fan,Kamil Slowikowski,Fan Zhang,Kevin Wei,Yuriy Baglaenko,Michael B. Brenner,Po-Ru Loh,Po-Ru Loh,Po-Ru Loh,Soumya Raychaudhuri +11 more
TL;DR: Harmony, for the integration of single-cell transcriptomic data, identifies broad and fine-grained populations, scales to large datasets, and can integrate sequencing- and imaging-based data.
read more
Abstract: The emerging diversity of single-cell RNA-seq datasets allows for the full transcriptional characterization of cell types across a wide variety of biological and clinical conditions. However, it is challenging to analyze them together, particularly when datasets are assayed with different technologies, because biological and technical differences are interspersed. We present Harmony (
https://github.com/immunogenomics/harmony
), an algorithm that projects cells into a shared embedding in which cells group by cell type rather than dataset-specific conditions. Harmony simultaneously accounts for multiple experimental and biological factors. In six analyses, we demonstrate the superior performance of Harmony to previously published algorithms while requiring fewer computational resources. Harmony enables the integration of ~106 cells on a personal computer. We apply Harmony to peripheral blood mononuclear cells from datasets with large experimental differences, five studies of pancreatic islet cells, mouse embryogenesis datasets and the integration of scRNA-seq with spatial transcriptomics data. Harmony, for the integration of single-cell transcriptomic data, identifies broad and fine-grained populations, scales to large datasets, and can integrate sequencing- and imaging-based data.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Developmental landscape of human forebrain at a single-cell level identifies early waves of oligodendrogenesis.
David van Bruggen,Fabio Pohl,Christoffer Mattsson Langseth,Petra Kukanja,Hower Lee,Alejandro Mossi Albiach,Mukund Kabbe,Mandy Meijer,Sten Linnarsson,Markus M. Hilscher,Mats Nilsson,Erik Sundström,Gonçalo Castelo-Branco +12 more
TL;DR: Using single-cell RNA sequencing, the authors found evidence of the emergence of a first wave of oligodendrocyte lineage cells as early as post-conception weeks (PCW) 8-10.
48
A human model of asthma exacerbation reveals transcriptional programs and cell circuits specific to allergic asthma
Jehan Alladina,Neal Smith,Tristan Kooistra,Kamil Slowikowski,Isabela Kernin,Jacques Deguine,Harry Keen,Kasidet Manakongtreecheep,Jessica Tantivit,Rod A. Rahimi,Alexis Haring,F. Giacona,Lida P. Hariri,Ramnik J. Xavier,Andrew D. Luster,Alexandra-Chloé Villani,Josalyn L. Cho,Benjamin D. Medoff +17 more
TL;DR: In this article , the authors compared the lower airway mucosa in allergic asthmatics and allergic non-asthmatic controls using single-cell RNA sequencing and found that the airway epithelium was highly dynamic and upregulated genes involved in matrix degradation, mucus metaplasia, and glycolysis while failing to induce injury-repair and antioxidant pathways observed in controls.
48
Resolving SARS-CoV-2 CD4+ T cell specificity via reverse epitope discovery
Mikhail V. Pogorelyy,Elisa Rosati,Anastasia A. Minervina,Robert C. Mettelman,Alexander Scheffold,Andre Franke,Petra Bacher,Paul G. Thomas +7 more
TL;DR: In this article , the authors performed a meta-analysis of large, publicly available single-cell and bulk TCR datasets from severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2)-infected individuals to identify public CD4+ responses.
48
Control of neurogenic competence in mammalian hypothalamic tanycytes.
Sooyeon Yoo,Sooyeon Yoo,Juhyun Kim,Pin Lyu,Thanh Hoang,Alex Ma,Vickie Trinh,Weina Dai,Lizhi Jiang,Patrick Leavey,Leighton Duncan,Jae Kyung Won,Sung Hye Park,Jiang Qian,Solange P. Brown,Solange P. Brown,Seth Blackshaw +16 more
TL;DR: In this paper, the authors show that tanycyte-specific disruption of the NFI family of transcription factors (Nfia/b/x) robustly stimulates the proliferation and neurogenesis in the postnatal hypothalamus.
48
Gut microbiome is linked to functions of peripheral immune cells in transition cows during excessive lipolysis
Fengfei Gu,Senlin Zhu,Yi Tang,Xiaohan Liu,Minghui Jia,Nilusha Malmuthuge,Teresa G. Valencak,Joseph W. McFadden,Jianxin Liu,Hui-Zeng Sun +9 more
TL;DR: In this article , the authors investigated the potential links between the gut microbiome and postpartum immunosuppression in periparturient dairy cows with excessive lipolysis using single immune cell transcriptome, 16S amplicon sequencing, metagenomics, and targeted metabolomics.
References
STAR: ultrafast universal RNA-seq aligner
Alexander Dobin,Carrie A. Davis,Felix Schlesinger,Jorg Drenkow,Chris Zaleski,Sonali Jha,Philippe Batut,Mark Chaisson,Thomas R. Gingeras +8 more
TL;DR: The Spliced Transcripts Alignment to a Reference (STAR) software based on a previously undescribed RNA-seq alignment algorithm that uses sequential maximum mappable seed search in uncompressed suffix arrays followed by seed clustering and stitching procedure outperforms other aligners by a factor of >50 in mapping speed.
Gene Ontology: tool for the unification of biology
M Ashburner,Catherine A. Ball,Judith A. Blake,David Botstein,Heather Butler,J. M. Cherry,Allan Peter Davis,Kara Dolinski,Selina S. Dwight,J.T. Eppig,Midori A. Harris,David P. Hill,Laurie Issel-Tarver,Andrew Kasarskis,Suzanna E. Lewis,John C. Matese,Joel E. Richardson,M. Ringwald,Gerald M. Rubin,Gavin Sherlock +19 more
TL;DR: The goal of the Gene Ontology Consortium is to produce a dynamic, controlled vocabulary that can be applied to all eukaryotes even as knowledge of gene and protein roles in cells is accumulating and changing.
limma powers differential expression analyses for RNA-sequencing and microarray studies
Matthew E. Ritchie,Belinda Phipson,Di Wu,Yifang Hu,Charity W. Law,Wei Shi,Gordon K. Smyth,Gordon K. Smyth +7 more
TL;DR: The philosophy and design of the limma package is reviewed, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.
Fast unfolding of communities in large networks
Vincent D. Blondel,Jean-Loup Guillaume,Jean-Loup Guillaume,Renaud Lambiotte,Renaud Lambiotte,Etienne Lefebvre +5 more
TL;DR: This work proposes a heuristic method that is shown to outperform all other known community detection methods in terms of computation time and the quality of the communities detected is very good, as measured by the so-called modularity.
Integrating single-cell transcriptomic data across different conditions, technologies, and species.
TL;DR: An analytical strategy for integrating scRNA-seq data sets based on common sources of variation is introduced, enabling the identification of shared populations across data sets and downstream comparative analysis.
Related Papers (5)
Grace X.Y. Zheng,Jessica M. Terry,Phillip Belgrader,Paul Ryvkin,Zachary Bent,Ryan Wilson,Solongo B. Ziraldo,Tobias Daniel Wheeler,Geoffrey P. McDermott,Junjie Zhu,Mark T. Gregory,Joe Shuga,Luz Montesclaros,Jason G. Underwood,Donald A. Masquelier,Stefanie Y. Nishimura,Michael Schnall-Levin,Paul Wyatt,Christopher Hindson,Rajiv Bharadwaj,Alexander Wong,Kevin D. Ness,Lan Beppu,H. Joachim Deeg,Christopher McFarland,Keith R. Loeb,Keith R. Loeb,William J. Valente,William J. Valente,Nolan G. Ericson,Emily A. Stevens,Jerald P. Radich,Tarjei S. Mikkelsen,Benjamin J. Hindson,Jason H. Bielas +34 more
Evan Z. Macosko,Evan Z. Macosko,Anindita Basu,Anindita Basu,Rahul Satija,Rahul Satija,James Nemesh,James Nemesh,Karthik Shekhar,Melissa Goldman,Melissa Goldman,Itay Tirosh,Allison R. Bialas,Nolan Kamitaki,Nolan Kamitaki,Emily M. Martersteck,John J. Trombetta,David A. Weitz,Joshua R. Sanes,Alex K. Shalek,Alex K. Shalek,Alex K. Shalek,Aviv Regev,Aviv Regev,Aviv Regev,Steven A. McCarroll,Steven A. McCarroll +26 more