Journal Article10.1101/2024.08.04.606512
Identifying Differential Spatial Expression Patterns across Different Slices, Conditions and Developmental Stages with Interpretable Deep Learning
TL;DR: Researchers propose River, an interpretable deep learning method, to identify differential spatial expression patterns across multiple tissue slices, conditions, and developmental stages, showcasing its scalability and versatility in analyzing complex tissue architectures.
read more
Abstract: Spatially resolved transcriptomics technologies enable the mapping of multiplexed gene expression profiles within tissue contexts. To explore the gene spatial patterns in complex tissues, computational methods have been developed to identify spatially variable genes within single tissue slices. However, there is a lack of methods designed to identify genes with differential spatial expression patterns (DSEPs) across multiple slices or conditions, which becomes increasingly common in complex experimental designs. The challenges include the complexity of cross-slice gene expression and spatial information modeling, scalability issues in constructing large-scale cell graphs, and mixed factors of inter-slice heterogeneity. We propose DSEP gene identification as a new task and develop River, an interpretable deep learning-based method, to solve this task. River comprises a two-branch prediction model architecture and a post-hoc attribution method to prioritize DSEP genes that explain condition differences. River’s special design for modeling spatial-informed gene expression makes it scalable to large-scale spatial omics datasets. We proposed strategies to decouple the spatial and non-spatial components of River’s outcomes. We validated River’s performance using simulated datasets and applied it to identify DSEP genes/proteins in diverse biological contexts, including embryo development, diabetes-induced alterations in spermatogenesis, and lupus-induced splenic changes. In a human triple-negative breast cancer dataset, River identified generalizable survival-related DSEPs, validated across unseen patient groups. River does not rely on specific data distribution assumptions and is compatible with various spatial omics data types, making it a versatile method for analyzing complex tissue architectures across multiple biological conditions.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
References
•Journal Article
Visualizing Data using t-SNE
TL;DR: A new technique called t-SNE that visualizes high-dimensional data by giving each datapoint a location in a two or three-dimensional map, a variation of Stochastic Neighbor Embedding that is much easier to optimize, and produces significantly better visualizations by reducing the tendency to crowd points together in the center of the map.
Regression Shrinkage and Selection via the Lasso
TL;DR: A new method for estimation in linear models called the lasso, which minimizes the residual sum of squares subject to the sum of the absolute value of the coefficients being less than a constant, is proposed.
•Proceedings Article
A unified approach to interpreting model predictions
Scott M. Lundberg,Su-In Lee +1 more
- 04 Dec 2017
TL;DR: In this article, a unified framework for interpreting predictions, SHAP (SHapley Additive exPlanations), is presented, which assigns each feature an importance value for a particular prediction.
•Posted Content
UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction
Leland McInnes,John Healy +1 more
TL;DR: The UMAP algorithm is competitive with t-SNE for visualization quality, and arguably preserves more of the global structure with superior run time performance.
9.9K
SCANPY: large-scale single-cell gene expression data analysis
TL;DR: This work presents Scanpy, a scalable toolkit for analyzing single-cell gene expression data that includes methods for preprocessing, visualization, clustering, pseudotime and trajectory inference, differential expression testing, and simulation of gene regulatory networks, and AnnData, a generic class for handling annotated data matrices.