pyComBat, a Python tool for batch effects correction in high-throughput molecular data using empirical Bayes methods
TL;DR: It is shown here that the new Python implementation of ComBat has similar results in terms of batch effects correction; is as fast or faster than the R implementation; and offers new tools for the bioinformatics community to participate in its development.
read more
Abstract: Summary Variability in datasets is not only the product of biological processes: they are also the product of technical biases. ComBat is one of the most widely used tool for correcting those technical biases, called batch effects, in microarray expression data. In this technical note, we present a new Python implementation of ComBat. While the mathematical framework is strictly the same, we show here that our implementation: (i) has similar results in terms of batch effects correction; (ii) is as fast or faster than the R implementation of ComBat and; (iii) offers new tools for the bioinformatics community to participate in its development. Availability and Implementation pyComBat is implemented in the Python language and is available under GPL-3.0 (https://www.gnu.org/licenses/gpl-3.0.en.html) license at https://github.com/epigenelabs/pyComBat and https://pypi.org/project/combat/. Contact akpeli@epigenelabs.com
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Hemin-Induced Death Models Hemorrhagic Stroke and Is a Variant of Classical Neuronal Ferroptosis
05 Jan 2022
TL;DR: In this article , the authors showed that hemin exposure, a model of secondary injury in brain hemorrhage and ferroptosis, activated ERK1/2 in mouse neurons.
30
Single cell dynamics of tumor specificity vs bystander activity in CD8+ T cells define the diverse immune landscapes in colorectal cancer
Daniel Morales Borràs,Sara Verbandt,Markus Ausserhofer,Gregor Sturm,Jinyeong Lim,Gil Arasa Verge,Isaure Vanmeerbeek,Raquel S. Laureano,Jannes Govaerts,Jenny Sprooten,Yourae Hong,Rebecca Wall,Gert de Hertogh,Xavier Sagaert,G Bislenghi,André D'Hoore,Albert Wolthuis,Francesca Finotello,Woong-Yang Park,S. Naulaerts,Sabine Tejpar,Abhishek D. Garg +21 more
TL;DR: A deep single cell mapping of CRC CD8^+ T cells on transcriptomic and T cell receptor (TCR) repertoire levels in a diverse patient cohort revealed that CRC T cell dynamics are underscored by complex interactions between interferon-γ signaling, tumor reactivity, TCR repertoire, and environmental cues like gut microbiome or colon tissue-specific ‘self-like’ features.
30
Perspectives for better batch effect correction in mass-spectrometry-based proteomics
TL;DR: In this paper , the authors explore several relevant issues pertinent to batch effect correct considerations, including applications of batch effect correction requiring prior knowledge on batch factors and exploring data to uncover new/unknown batch factors.
18
OmicVerse: a framework for bridging and deepening insights across bulk and single-cell sequencing
Zehua Zeng,Yuqing Ma,Lei Hu,Bradford A. Tan,Peng Liu,Yixuan Wang,Cencan Xing,Yuanyan Xiong,Hongwu Du +8 more
TL;DR: The single cell trajectory blending from Bulk RNA-seq (BulkTrajBlend) algorithm is introduced, a component of the OmicVerse suite that leverages a Beta-Variational AutoEncoder for data deconvolution and graph neural networks for the discovery of overlapping communities.
16
A Novel Nanosafety Approach Using Cell Painting, Metabolomics, and Lipidomics Captures the Cellular and Molecular Phenotypes Induced by the Unintentionally Formed Metal-Based (Nano)Particles
Andi Alijagic,N. Scherbak,O. V. Kotlyar,Patrik Karlsson,Xuying Wang,I. Odnevall,Oldřich Benada,Ali Amiryousefi,Lena Andersson,Alexander Persson,Jenny Felth,Henrik Andersson,Maria Larsson,Alexander Hedbrant,Samira Salihovic,Tuulia Hyötyläinen,Dirk Repsilber,Eva Särndahl,Magnus Engwall +18 more
TL;DR: In this paper , the effects of the unintentionally emitted AM (nano)particles (AMPs) on human cells following inhalation, require further investigations, including physicochemical characterization of the AMPs, extracted from the filter of a Laser Powder Bed Fusion (L-PBF) 3D printer of iron-based materials, disclosed their complexity, in terms of size, shape, and chemistry.
References
Exploration, normalization, and summaries of high density oligonucleotide array probe level data
Rafael A. Irizarry,Bridget G. Hobbs,Francois Collin,Yasmin Beazer-Barclay,Kristen J. Antonellis,Uwe Scherf,Terence P. Speed +6 more
TL;DR: There is no obvious downside to using RMA and attaching a standard error (SE) to this quantity using a linear model which removes probe-specific affinities, and the exploratory data analyses of the probe level data motivate a new summary measure that is a robust multi-array average (RMA) of background-adjusted, normalized, and log-transformed PM values.
The NumPy Array: A Structure for Efficient Numerical Computation
TL;DR: In this article, the authors show how to improve the performance of NumPy arrays through vectorizing calculations, avoiding copying data in memory, and minimizing operation counts, which is a technique similar to the one described in this paper.
Data Structures for Statistical Computing in Python
Wes McKinney
- 01 Jan 2010
TL;DR: P pandas is a new library which aims to facilitate working with data sets common to finance, statistics, and other related fields and to provide a set of fundamental building blocks for implementing statistical models.
Adjusting batch effects in microarray expression data using empirical Bayes methods
TL;DR: This paper proposed parametric and non-parametric empirical Bayes frameworks for adjusting data for batch effects that is robust to outliers in small sample sizes and performs comparable to existing methods for large samples.
Comprehensive molecular characterization of human colon and rectal cancer
Donna M. Muzny,Matthew N. Bainbridge,Kyle Chang,Huyen Dinh,Jennifer Drummond,Gerald R. Fowler,Christie Kovar,Lora Lewis,Margaret Morgan,Irene Newsham,Jeffrey G. Reid,Jireh Santibanez,Eve Shinbrot,Lisa R. Trevino,Yuan Qing Wu,Min Wang,Preethi H. Gunaratne,Preethi H. Gunaratne,Lawrence A. Donehower,Chad J. Creighton,David A. Wheeler,Richard A. Gibbs,Michael S. Lawrence,Douglas Voet,Rui Jing,Kristian Cibulskis,Andrey Sivachenko,Andrey Sivachenko,Petar Stojanov,Aaron McKenna,Eric S. Lander,Eric S. Lander,Stacey Gabriel,Li Ding,Robert S. Fulton,Daniel C. Koboldt,Todd Wylie,Jason Walker,David J. Dooling,Lucinda Fulton,Kim D. Delehaunty,Catrina Fronick,Ryan Demeter,Elaine R. Mardis,Richard K. Wilson,Andy Chu,Hye Jung E. Chun,Andrew J. Mungall,Erin Pleasance,A. Gordon Robertson,Dominik Stoll,Miruna Balasundaram,Inanc Birol,Yaron S.N. Butterfield,Eric Chuah,Robin J.N. Coope,Noreen Dhalla,Ranabir Guin,Carrie Hirst,Martin Hirst,Robert A. Holt,Darlene Lee,Haiyan I. Li,Michael Mayo,Richard A. Moore,Jacqueline E. Schein,Jared R. Slobodan,Angela Tam,Nina Thiessen,Richard Varhol,Thomas Zeng,Yongjun Zhao,Steven J.M. Jones,Marco A. Marra,Adam J. Bass,Alex H. Ramos,Gordon Saksena,Andrew D. Cherniack,Stephen E. Schumacher,Barbara Tabak,Scott L. Carter,Nam Pho,Huy V. Nguyen,Robert C. Onofrio,Andrew Crenshaw,Kristin G. Ardlie,Rameen Beroukhim,Wendy Winckler,Matthew Meyerson,Alexei Protopopov,Angela Hadjipanayis,Eunjung Lee,Ruibin Xi,Lixing Yang,Xiaojia Ren,Narayanan Sathiamoorthy,Peng Chieh Chen,Psalm Haseley,Yonghong Xiao,Semin Lee,Jonathan G. Seidman,Lynda Chin,Peter J. Park,Raju Kucherlapati,J. Todd Auman,Katherine A. Hoadley,Ying Du,Matthew D. Wilkerson,Yan Shi,Christina Liquori,Shaowu Meng,Ling Li,Yidi J. Turman,Michael D. Topal,Donghui Tan,Scot Waring,Elizabeth Buda,Jesse Walsh,Corbin D. Jones,Piotr A. Mieczkowski,Darshan Singh,Junyuan Wu,Anisha Gulabani,Peter Dolina,Tom Bodenheimer,Alan P. Hoyle,Janae V. Simons,Matthew G. Soloway,Lisle E. Mose,Stuart R. Jefferys,Saianand Balu,Brian O'Connor,Jan F. Prins,Derek Y. Chiang,D. Neil Hayes,Charles M. Perou,Toshinori Hinoue,Daniel J. Weisenberger,Dennis T. Maglinte,Fei Pan,Benjamin P. Berman,David Van Den Berg,Hui Shen,Timothy J. Triche,Stephen B. Baylin,Peter W. Laird,Gad Getz,Michael S. Noble,Doug Voat,Nils Gehlenborg,Daniel DiCara,Juinhua Zhang,Hailei Zhang,Chang-Jiun Wu,Spring Yingchun Liu,Sachet A. Shukla,Lihua Zhou,Pei Lin,Richard W. Park,Marc Danie Nazaire,James A. Robinson,Helga Thorvaldsdottir,Jill P. Mesirov,Vesteinn Thorsson,Sheila Reynolds,Brady Bernard,Richard Kreisberg,Jake Lin,Lisa Iype,Ryan Bressler,Timo Erkkilä,Madhumati Gundapuneni,Yuexin Liu,Adam Norberg,Thomas Robinson,Da Yang,Wei Zhang,Ilya Shmulevich,Jorma J. de Ronde,Jorma J. de Ronde,Nikolaus Schultz,Ethan Cerami,Giovanni Ciriello,Arthur P. Goldberg,Benjamin Gross,Anders Jacobsen,Jianjiong Gao,Bogumil Kaczkowski,Rileen Sinha,B. Arman Aksoy,Yevgeniy Antipin,Boris Reva,Ronglai Shen,Barry S. Taylor,Marc Ladanyi,Chris Sander,Rehan Akbani,Nianxiang Zhang,Bradley M. Broom,Tod D. Casasent,Anna K. Unruh,Chris Wakefield,Stanley R. Hamilton,R. Craig Cason,Keith A. Baggerly,John N. Weinstein,David Haussler,Christopher C. Benz,Joshua M. Stuart,Stephen C. Benz,J. Zachary Sanborn,Charles J. Vaske,Jingchun Zhu,Christopher Szeto,Gary K. Scott,Christina Yau,Sam Ng,Theodore C. Goldstein,Kyle Ellrott,Eric A. Collisson,Aaron E. Cozen,Daniel R. Zerbino,Christopher Wilks,Brian Craft,Paul T. Spellman,Robert Penny,Troy Shelton,Martha Hatfield,Scott Morris,Peggy Yena,Candace Shelton,Mark Sherman,Joseph Paulauskis,Julie M. Gastier-Foster,Julie M. Gastier-Foster,Jay Bowen,Nilsa C. Ramirez,Nilsa C. Ramirez,Aaron D. Black,Robert E. Pyatt,Robert E. Pyatt,Lisa Wise,Peter White,Peter White,Monica M. Bertagnolli,Jen Brown,Timothy A. Chan,Gerald C. Chu,Christine Czerwinski,Fred Denstman,Rajiv Dhir,Arnulf Dörner,Charles S. Fuchs,Jose G. Guillem,Mary Iacocca,Hartmut Juhl,Andrew Kaufman,Bernard Kohl,Xuan Van Le,Maria C. Mariano,Elizabeth N. Medina,Michael Meyers,Garrett M. Nash,P. Paty,Nicholas J. Petrelli,Brenda Rabeno,William G. Richards,David B. Solit,Pat Swanson,Larissa K. Temple,Joel E. Tepper,Richard Thorp,Efsevia Vakiani,Martin R. Weiser,Joseph Willis,Gary Witkin,Zhaoshi Zeng,Michael J. Zinner,Carsten Zornig,Mark A. Jensen,Robert Sfeir,Ari B. Kahn,Anna L. Chu,Prachi Kothiyal,Zhining Wang,Eric E. Snyder,Joan Pontius,Todd Pihl,Brenda Ayala,Mark Backus,Jessica Walton,Jon Whitmore,Julien Baboud,Dominique L. Berton,Matthew C. Nicholls,Deepak Srinivasan,Rohini Raman,Stanley Girshik,Peter A. Kigonya,Shelley Alonso,Rashmi N. Sanbhadti,Sean P. Barletta,John M. Greene,David Pot,Kenna R. Mills Shaw,Laura A.L. Dillon,Kenneth H. Buetow,Tanja Davidsen,John A. Demchok,Greg Eley,Martin L. Ferguson,Peter Fielding,Carl F. Schaefer,Margi Sheth,Liming Yang,Mark S. Guyer,Bradley A. Ozenberger,Jacqueline D. Palchik,Jane Peterson,Heidi J. Sofia,Elizabeth J. Thomson +320 more
TL;DR: Integrative analyses suggest new markers for aggressive colorectal carcinoma and an important role for MYC-directed transcriptional activation and repression.

