Open AccessBook
Machine Learning for Microbial Phenotype Prediction
Roman Feldbauer
- 15 Jun 2016
4
TL;DR: The results suggest that the presented prediction tool can be used to automatically annotate phenotypes in near-complete microbial genome sequences, as generated in large numbers in current metagenomic studies.
read more
Abstract: This thesis presents a scalable, generic methodology for microbial phenotype prediction based on supervised machine learning, several models for biological and ecological traits of high relevance, and the deployment in metagenomic datasets. The results suggest that the presented prediction tool can be used to automatically annotate phenotypes in near-complete microbial genome sequences, as generated in large numbers in current metagenomic studies. Unraveling relationships between a living organism's genetic information and its observable traits is a central biological problem. Phenotype prediction facilitated by machine learning techniques will be a major step forward to creating biological knowledge from big data. Contents Microbial Genotypes and Phenotypes Basics of Machine Learning Phenotype Prediction Packages A Model for Intracellular Lifestyle Target Groups Teachers and students in the fields of bioinformatics, molecular biology and microbiology Executives and specialists in the field of microbiology, computational biology and machine learning About the Author Roman Feldbauer is currently employed at the Austrian Research Institute for Artificial Intelligence (OFAI) and PhD student at the University of Vienna. His research interests are machine learning, data science, bioinformatics, comparative genomics and neuroscience. In one of his current projects he investigates large biological databases in regard to the curse of dimensionality
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Automated identification of Myxobacterial genera using Convolutional Neural Network.
TL;DR: All 30 genera of this order could be recognized based on stereomicroscopic images of the fruiting bodies at the genus level that not only does not urge us to amplify and sequence gene but also can be attained without preparation of microscopic slides of the vegetative cells or myxospores.
A New Phylogeny-Driven Random Forest-Based Classification Approach for Functional Metagenomics
06 Dec 2022
TL;DR: In this article , a new classification method based on RF as guided by the evolutionary ancestry of microbial phylogeny, i.e. Phylogeny-RF, has been developed in order to capture the effects of phylogenetic relatedness in a ML classifier itself.
1
A New Phylogeny-Driven Random Forest-Based Classification Approach for Functional Metagenomics
Jyotsna Talreja Wassan,Haiying Wang,Huiru Zheng +2 more
- 06 Dec 2022
TL;DR: In this paper , a new classification method based on RF as guided by the evolutionary ancestry of microbial phylogeny, i.e. Phylogeny-RF, has been developed in order to capture the effects of phylogenetic relatedness in a ML classifier itself.
1
•Book Chapter
Inferring orthology and paralogy.
Adrian M. Altenhoff,Adrian M. Altenhoff,Christophe Dessimoz,Christophe Dessimoz +3 more
- 01 Jan 2012
TL;DR: This chapter provides an overview of the methods used to infer orthology and paralogy, and surveys both graph-based approaches (and their various grouping strategies) and tree- based approaches, which solve the more general problem of gene/species tree reconciliation.
References
Pattern Recognition and Machine Learning
TL;DR: This book covers a broad range of topics for regular factorial designs and presents all of the material in very mathematical fashion and will surely become an invaluable resource for researchers and graduate students doing research in the design of factorial experiments.
30.8K
An obesity-associated gut microbiome with increased capacity for energy harvest
Peter J. Turnbaugh,Ruth E. Ley,Michael A. Mahowald,Vincent Magrini,Elaine R. Mardis,Jeffrey I. Gordon +5 more
TL;DR: It is demonstrated through metagenomic and biochemical analyses that changes in the relative abundance of the Bacteroidetes and Firmicutes affect the metabolic potential of the mouse gut microbiota and indicates that the obese microbiome has an increased capacity to harvest energy from the diet.
12K
Prodigal: prokaryotic gene recognition and translation initiation site identification
Doug Hyatt,Doug Hyatt,Gwo Liang Chen,Philip F. LoCascio,Miriam Land,Frank W. Larimer,Frank W. Larimer,Loren Hauser +7 more
TL;DR: This work developed a new gene prediction algorithm called Prodigal (PROkaryotic DYnamic programming Gene-finding ALgorithm), which achieved good results compared to existing methods, and it is believed it will be a valuable asset to automated microbial annotation pipelines.
STRING v10: protein–protein interaction networks, integrated over the tree of life
Damian Szklarczyk,Andrea Franceschini,Stefan Wyder,Kristoffer Forslund,Davide Heller,Jaime Huerta-Cepas,Milan Simonovic,Alexander Roth,Alberto Santos,Kalliopi Tsafou,Michael Kuhn,Peer Bork,Lars Juhl Jensen,Christian von Mering +13 more
TL;DR: H hierarchical and self-consistent orthology annotations are introduced for all interacting proteins, grouping the proteins into families at various levels of phylogenetic resolution in the STRING database.
Phylogenetic identification and in situ detection of individual microbial cells without cultivation.
TL;DR: Phylogenetic analysis of the retrieved rRNA sequence of an uncultured microorganism reveals its closest culturable relatives and may, together with information on the physicochemical conditions of its natural habitat, facilitate more directed cultivation attempts.
9.3K