Phylogenetic profiling

Topic Tools

Papers published on a yearly basis

Papers

Journal Article•10.1126/SCIENCE.278.5338.631•

A genomic perspective on protein families

[...]

Roman L. Tatusov¹, Eugene V. Koonin¹, David J. Lipman¹•Institutions (1)

National Institutes of Health¹

24 Oct 1997-Science

TL;DR: Comparison of proteins encoded in seven complete genomes from five major phylogenetic lineages and elucidation of consistent patterns of sequence similarities allowed the delineation of 720 clusters of orthologous groups (COGs), which comprise a framework for functional and evolutionary genome analysis.

...read moreread less

Abstract: In order to extract the maximum amount of information from the rapidly accumulating genome sequences, all conserved genes need to be classified according to their homologous relationships. Comparison of proteins encoded in seven complete genomes from five major phylogenetic lineages and elucidation of consistent patterns of sequence similarities allowed the delineation of 720 clusters of orthologous groups (COGs). Each COG consists of individual orthologous proteins or orthologous sets of paralogs from at least three lineages. Orthologs typically have the same function, allowing transfer of functional information from one member to an entire COG. This relation automatically yields a number of functional predictions for poorly characterized genomes. The COGs comprise a framework for functional and evolutionary genome analysis.

...read moreread less

3,812 citations

Patent•10.1073/PNAS.96.8.4285•

Assigning protein functions by comparative genome analysis protein phylogenetic profiles

[...]

Matteo Pellegrini¹, Edward M. Marcotte¹, Michael J. Thompson¹, David Eisenberg¹, Robert Grothe¹, Todd O. Yeates¹ - Show less +2 more•Institutions (1)

University of California¹

28 Jan 2000-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: In this paper, a computational method system and computer program are provided for inferring functional links from genome sequences, based on the observation that some pairs of proteins A′ and B′ have homologs in another organism fused into a single protein chain AB.

...read moreread less

Abstract: A computational method system, and computer program are provided for inferring functional links from genome sequences. One method is based on the observation that some pairs of proteins A′ and B′ have homologs in another organism fused into a single protein chain AB. A trans-genome comparison of sequences can reveal these AB sequences, which are Rosetta Stone sequences because they decipher an interaction between A′ and B. Another method compares the genomic sequence of two or more organisms to create a phylogenetic profile for each protein indicating its presence or absence across all the genomes. The profile provides information regarding functional links between different families of proteins. In yet another method a combination of the above two methods is used to predict functional links.

...read moreread less

1,976 citations

Journal Article•10.1126/SCIENCE.285.5428.751•

Detecting Protein Function and Protein-Protein Interactions from Genome Sequences

[...]

Edward M. Marcotte¹, Matteo Pellegrini¹, Ho Leung Ng¹, Danny W. Rice¹, Todd O. Yeates¹, David Eisenberg¹ - Show less +2 more•Institutions (1)

University of California, Los Angeles¹

30 Jul 1999-Science

TL;DR: Searching sequences from many genomes revealed 6809 putative protein-protein interactions in Escherichia coli and 45,502 in yeast, and many members of these pairs were confirmed as functionally related; computational filtering further enriches for interactions.

...read moreread less

Abstract: A computational method is proposed for inferring protein interactions from genome sequences on the basis of the observation that some pairs of interacting proteins have homologs in another organism fused into a single protein chain. Searching sequences from many genomes revealed 6809 such putative proteinprotein interactions in Escherichia coli and 45,502 in yeast. Many members of these pairs were confirmed as functionally related; computational filtering further enriches for interactions. Some proteins have links to several other proteins; these coupled links appear to represent functional interactions such as complexes or pathways. Experimentally confirmed interacting pairs are documented in a Database of Interacting Proteins.

...read moreread less

1,796 citations

Journal Article•10.1073/PNAS.96.6.2896•

The use of gene clusters to infer functional coupling

[...]

Ross Overbeek¹, Michael Fonstein, Mark D'Souza, Gordon D. Pusch, Natalia Maltsev - Show less +1 more•Institutions (1)

Argonne National Laboratory¹

16 Mar 1999-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: The characterization of the parameters that determine the utility of the approach are extended, and it is shown that this approach will play a significant role in supporting efforts to assign functionality to the remaining uncharacterized genes in sequenced genomes.

...read moreread less

Abstract: Previously, we presented evidence that it is possible to predict functional coupling between genes based on conservation of gene clusters between genomes. With the rapid increase in the availability of prokaryotic sequence data, it has become possible to verify and apply the technique. In this paper, we extend our characterization of the parameters that determine the utility of the approach, and we generalize the approach in a way that supports detection of common classes of functionally coupled genes (e.g., transport and signal transduction clusters). Now that the analysis includes over 30 complete or nearly complete genomes, it has become clear that this approach will play a significant role in supporting efforts to assign functionality to the remaining uncharacterized genes in sequenced genomes.

...read moreread less

1,364 citations

Journal Article•10.7554/ELIFE.65088•

Integrating taxonomic, functional, and strain-level profiling of diverse microbial communities with biobakery 3

[...]

Francesco Beghini¹, Lauren J. McIver², Aitor Blanco-Míguez¹, Leonard Dubois¹, Francesco Asnicar¹, Sagun Maharjan³, Sagun Maharjan², Ana Mailyan³, Ana Mailyan², Paolo Manghi¹, Matthias Scholz⁴, Andrew Maltez Thomas¹, Mireia Valles-Colomer¹, George Weingart², George Weingart³, Yancong Zhang³, Yancong Zhang², Moreno Zolfo¹, Curtis Huttenhower³, Curtis Huttenhower², Eric A. Franzosa², Eric A. Franzosa³, Nicola Segata¹, Nicola Segata⁵ - Show less +20 more•Institutions (5)

University of Trento¹, Harvard University², Broad Institute³, Edmund Mach Foundation⁴, European Institute of Oncology⁵

04 May 2021-eLife

TL;DR: BioBakery 3 as mentioned in this paper is a set of integrated, improved methods for taxonomic, strain-level, functional, and phylogenetic profiling of metagenomes newly developed to build on the largest set of reference sequences now available.

...read moreread less

Abstract: Culture-independent analyses of microbial communities have progressed dramatically in the last decade, particularly due to advances in methods for biological profiling via shotgun metagenomics. Opportunities for improvement continue to accelerate, with greater access to multi-omics, microbial reference genomes, and strain-level diversity. To leverage these, we present bioBakery 3, a set of integrated, improved methods for taxonomic, strain-level, functional, and phylogenetic profiling of metagenomes newly developed to build on the largest set of reference sequences now available. Compared to current alternatives, MetaPhlAn 3 increases the accuracy of taxonomic profiling, and HUMAnN 3 improves that of functional potential and activity. These methods detected novel disease-microbiome links in applications to CRC (1262 metagenomes) and IBD (1635 metagenomes and 817 metatranscriptomes). Strain-level profiling of an additional 4077 metagenomes with StrainPhlAn 3 and PanPhlAn 3 unraveled the phylogenetic and functional structure of the common gut microbe Ruminococcus bromii, previously described by only 15 isolate genomes. With open-source implementations and cloud-deployable reproducible workflows, the bioBakery 3 platform can help researchers deepen the resolution, scale, and accuracy of multi-omic profiling for microbial community studies.

...read moreread less

1,354 citations

...

Expand

Year	Papers
2021	12
2020	6
2019	5
2018	5
2017	10
2016	2

Topic Tools

Papers published on a yearly basis

Papers

A genomic perspective on protein families

Assigning protein functions by comparative genome analysis protein phylogenetic profiles

Detecting Protein Function and Protein-Protein Interactions from Genome Sequences

The use of gene clusters to infer functional coupling

Integrating taxonomic, functional, and strain-level profiling of diverse microbial communities with biobakery 3

Related Topics (5)

Performance Metrics