Using Network-Based Machine Learning to Predict Transcription Factors Involved in Drought Resistance
TL;DR: The Gene Regulation and Association Network (GRAiN) as discussed by the authors is an interactive query-based web-platform that allows users to study functional relationships between transcription factors (TFs) and genetic modules underlying abiotic-stress responses.
read more
Abstract: Gene regulatory networks underpin stress response pathways in plants. However, parsing these networks to prioritize key genes underlying a particular trait is challenging. Here, we have built the Gene Regulation and Association Network (GRAiN) of rice (Oryza sativa). GRAiN is an interactive query-based web-platform that allows users to study functional relationships between transcription factors (TFs) and genetic modules underlying abiotic-stress responses. We built GRAiN by applying a combination of different network inference algorithms to publicly available gene expression data. We propose a supervised machine learning framework that complements GRAiN in prioritizing genes that regulate stress signal transduction and modulate gene expression under drought conditions. Our framework converts intricate network connectivity patterns of 2160 TFs into a single drought score. We observed that TFs with the highest drought scores define the functional, structural, and evolutionary characteristics of drought resistance in rice. Our approach accurately predicted the function of OsbHLH148 TF, which we validated using in vitro protein-DNA binding assays and mRNA sequencing loss-of-function mutants grown under control and drought stress conditions. Our network and the complementary machine learning strategy lends itself to predicting key regulatory genes underlying other agricultural traits and will assist in the genetic engineering of desirable rice varieties.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Machine learning bridges omics sciences and plant breeding.
Jun Yan,Xiangfeng Wang +1 more
TL;DR: In this paper , a review of ML for multi-omics analysis in plants, including data dimensionality reduction, inference of gene-regulation networks, and gene discovery and prioritization, is presented.
79
Artificial intelligence in plant breeding
Muhammad Amjad Farooq,Shang Gao,Muhammad Adeel Hassan,Zhangping Huang,Awais Rasheed,Sarah Hearne,B. M. Prasanna,Xinhai Li,Huihui Li +8 more
TL;DR: Artificial intelligence is revolutionizing plant breeding by leveraging big data analysis, unlocking genetic diversity, and bridging genotype-phenotype gaps to develop tailored crop cultivars, refine crop traits, and optimize cropping systems for enhanced agricultural sustainability and productivity.
28
Modelling agricultural drought: a review of latest advances in big data technologies
TL;DR: In this paper , the authors reviewed the main recent applications of multi-sensor remote sensing and artificial intelligence techniques in multivariate modelling of agricultural drought, focusing mainly on three fundamental aspects, namely descriptive modelling, predictive modelling, and spatial modelling of expected risks and vulnerability to drought.
24
Genetic Dissection of Grain Yield Component Traits Under High Nighttime Temperature Stress in a Rice Diversity Panel.
TL;DR: In this article, a diverse panel of 190 rice accessions of the USDA rice mini-core (URMC) were treated with high nighttime (HNT) stress at the reproductive stage of panicle initiation, and the quantifiable yield component response traits measured.
Data-driven approaches to improve water-use efficiency and drought resistance in crop plants.
Niharika Sharma,Harsh Raman,David Wheeler,Yogendra Kalenahalli,Rita Sharma +4 more
TL;DR: Researchers leverage big data, omics technologies, and artificial intelligence to improve crop water-use efficiency and drought resistance, identifying genetic markers and superior haplotypes for breeding programs and developing drought-tolerant crop varieties with high-yield potential.
11
References
Controlling the false discovery rate: a practical and powerful approach to multiple testing
Yoav Benjamini,Yosef Hochberg +1 more
TL;DR: In this paper, a different approach to problems of multiple significance testing is presented, which calls for controlling the expected proportion of falsely rejected hypotheses -the false discovery rate, which is equivalent to the FWER when all hypotheses are true but is smaller otherwise.
Cytoscape: A Software Environment for Integrated Models of Biomolecular Interaction Networks
Paul Shannon,Andrew Markiel,Owen Ozier,Nitin S. Baliga,Jonathan T. Wang,Daniel Ramage,Nada Amin,Benno Schwikowski,Trey Ideker +8 more
TL;DR: Several case studies of Cytoscape plug-ins are surveyed, including a search for interaction pathways correlating with changes in gene expression, a study of protein complexes involved in cellular recovery to DNA damage, inference of a combined physical/functional interaction network for Halobacterium, and an interface to detailed stochastic/kinetic gene regulatory models.
STAR: ultrafast universal RNA-seq aligner
Alexander Dobin,Carrie A. Davis,Felix Schlesinger,Jorg Drenkow,Chris Zaleski,Sonali Jha,Philippe Batut,Mark Chaisson,Thomas R. Gingeras +8 more
TL;DR: The Spliced Transcripts Alignment to a Reference (STAR) software based on a previously undescribed RNA-seq alignment algorithm that uses sequential maximum mappable seed search in uncompressed suffix arrays followed by seed clustering and stitching procedure outperforms other aligners by a factor of >50 in mapping speed.
Support-Vector Networks
Corinna Cortes,Vladimir Vapnik +1 more
TL;DR: High generalization ability of support-vector networks utilizing polynomial input transformations is demonstrated and the performance of the support- vector network is compared to various classical learning algorithms that all took part in a benchmark study of Optical Character Recognition.
edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.
TL;DR: EdgeR as mentioned in this paper is a Bioconductor software package for examining differential expression of replicated count data, which uses an overdispersed Poisson model to account for both biological and technical variability and empirical Bayes methods are used to moderate the degree of overdispersion across transcripts, improving the reliability of inference.
39.8K