Sequence Ontology

Topic Tools

Papers

Journal Article•10.1038/75556•

Gene Ontology: tool for the unification of biology

[...]

M Ashburner¹, Catherine A. Ball, Judith A. Blake, David Botstein, Heather Butler, J. M. Cherry, Allan Peter Davis, Kara Dolinski, Selina S. Dwight, J.T. Eppig, Midori A. Harris, David P. Hill, Laurie Issel-Tarver, Andrew Kasarskis, Suzanna E. Lewis, John C. Matese, Joel E. Richardson, M. Ringwald, Gerald M. Rubin, Gavin Sherlock - Show less +16 more•Institutions (1)

Stanford University¹

01 May 2000-Nature Genetics

TL;DR: The goal of the Gene Ontology Consortium is to produce a dynamic, controlled vocabulary that can be applied to all eukaryotes even as knowledge of gene and protein roles in cells is accumulating and changing.

...read moreread less

Abstract: Genomic sequencing has made it clear that a large fraction of the genes specifying the core biological functions are shared by all eukaryotes. Knowledge of the biological role of such shared proteins in one organism can often be transferred to other organisms. The goal of the Gene Ontology Consortium is to produce a dynamic, controlled vocabulary that can be applied to all eukaryotes even as knowledge of gene and protein roles in cells is accumulating and changing. To this end, three independent ontologies accessible on the World-Wide Web (http://www.geneontology.org) are being constructed: biological process, molecular function and cellular component.

...read moreread less

42,596 citations

Journal Article•10.1038/NBT1346•

The OBO Foundry : coordinated evolution of ontologies to support biomedical data integration

[...]

Barry Smith¹, Michael Ashburner², Cornelius Rosse³, Jonathan Bard⁴, William J. Bug⁵, Werner Ceusters¹, Louis J. Goldberg¹, Karen Eilbeck⁶, Amelia Ireland⁷, Christopher J. Mungall⁸, Neocles B. Leontis⁹, Philippe Rocca-Serra⁷, Alan Ruttenberg¹⁰, Susanna-Assunta Sansone⁷, Richard H. Scheuermann¹¹, Nigam H. Shah¹², Patricia L. Whetzel¹³, Suzanna E. Lewis⁸ - Show less +14 more•Institutions (13)

University at Buffalo¹, University of Cambridge², University of Washington³, University of Edinburgh⁴, Drexel University⁵, University of Utah⁶, European Bioinformatics Institute⁷, Lawrence Berkeley National Laboratory⁸, Bowling Green State University⁹, Massachusetts Institute of Technology¹⁰, University of Texas Southwestern Medical Center¹¹, Stanford University¹², University of Pennsylvania¹³

01 Nov 2007-Nature Biotechnology

TL;DR: This work describes the OBO Foundry initiative and provides guidelines for those who might wish to become involved and describes an expanding family of ontologies designed to be interoperable and logically well formed and to incorporate accurate representations of biological reality.

...read moreread less

Abstract: The value of any kind of data is greatly enhanced when it exists in a form that allows it to be integrated with other data. One approach to integration is through the annotation of multiple bodies of data using common controlled vocabularies or 'ontologies'. Unfortunately, the very success of this approach has led to a proliferation of ontologies, which itself creates obstacles to integration. The Open Biomedical Ontologies (OBO) consortium is pursuing a strategy to overcome this problem. Existing OBO ontologies, including the Gene Ontology, are undergoing coordinated reform, and new ontologies are being created on the basis of an evolving set of shared principles governing ontology development. The result is an expanding family of ontologies designed to be interoperable and logically well formed and to incorporate accurate representations of biological reality. We describe this OBO Foundry initiative and provide guidelines for those who might wish to become involved.

...read moreread less

2,845 citations

Journal Article•10.1186/GB-2005-6-5-R44•

The Sequence Ontology: a tool for the unification of genome annotations

[...]

Karen Eilbeck¹, Suzanna E. Lewis¹, Christopher J. Mungall¹, Mark Yandell¹, Lincoln Stein², Richard Durbin³, Michael Ashburner⁴ - Show less +3 more•Institutions (4)

University of California, Berkeley¹, Cold Spring Harbor Laboratory², Wellcome Trust Sanger Institute³, University of Cambridge⁴

29 Apr 2005-Genome Biology

TL;DR: The Sequence Ontology is a structured controlled vocabulary for the parts of a genomic annotation that provides a common set of terms and definitions that will facilitate the exchange, analysis and management of genomic data.

...read moreread less

Abstract: The Sequence Ontology (SO) is a structured controlled vocabulary for the parts of a genomic annotation. SO provides a common set of terms and definitions that will facilitate the exchange, analysis and management of genomic data. Because SO treats part-whole relationships rigorously, data described with it can become substrates for automated reasoning, and instances of sequence features described by the SO can be subjected to a group of logical operations termed extensional mereology operators.

...read moreread less

891 citations

Journal Article•10.1093/NAR/GKM883•

The Gene Ontology project in 2008

[...]

Midori A. Harris, Jennifer I. Deegan, Amelia Ireland, Jane Lomax, Michael Ashburner¹, Susan Tweedie¹, Seth Carbon², Suzanna E. Lewis², Christopher J. Mungall², John Day Richter², Karen Eilbeck, Judith A. Blake, Carol J. Bult, Alexander D. Diehl, Mary E. Dolan, Harold J. Drabkin, Janan T. Eppig, David P. Hill, Ni Li, Martin Ringwald, Rama Balakrishnan³, Gail Binkley³, J. Michael Cherry³, Karen R. Christie³, Maria C. Costanzo³, Qing Dong³, Stacia R. Engel³, Dianna G. Fisk³, Jodi E. Hirschman³, Benjamin C. Hitz³, Eurie L. Hong³, Cynthia J. Krieger³, Stuart R. Miyasato³, Robert S. Nash³, Julie Park³, Marek S. Skrzypek³, Shuai Weng³, Edith D. Wong³, Kathy K. Zhu³, David Botstein⁴, Kara Dolinski⁴, Michael S. Livstone⁴, Rose Oughtred⁴, Tanya Z. Berardini⁵, Li Donghui⁵, Seung Y. Rhee⁵, Rolf Apweiler⁶, Daniel Barrell⁶, Evelyn Camon⁶, Emily Dimmer⁶, Rachael P. Huntley, Nicola Mulder, Varsha K. Khodiyar, Ruth C. Lovering, Sue Povey, Rex L. Chisholm, Petra Fey, Pascale Gaudet, Warren A. Kibbe, Ranjana Kishore, Erich M. Schwarz, Paul W. Sternberg, Kimberly Van Auken, Michelle G. Giglio, Linda Hannick, Jennifer R. Wortman, Martin Aslett, Matthew Berriman, Valerie Wood, Howard J. Jacob, Stan Laulederkind, Victoria Petri, Mary Shimoyama, Jennifer L. Smith, Simon N. Twigger, Pankaj Jaiswal, Trent E. Seigfried, Doug Howe, Monte Westerfield, Candace Collmer, Trudy Torto Alalibo, Erika Feltrin, Giorgio Valle, Susan Bromberg, Shane C. Burgess, Fiona M. McCarthy - Show less +82 more•Institutions (6)

University of Cambridge¹, Lawrence Berkeley National Laboratory², Stanford University³, Princeton University⁴, Carnegie Institution for Science⁵, European Bioinformatics Institute⁶

01 Jan 2008-Nucleic Acids Research

TL;DR: The GO Consortium has launched a focused effort to provide comprehensive and detailed annotation of orthologous genes across a number of ‘reference’ genomes, including human and several key model organisms.

...read moreread less

Abstract: The Gene Ontology (GO) project (http://www.geneontology.org) provides a set of structured, controlled vocabularies for community use in annotating genes, gene products and sequences (also see http://www.sequenceontology.org/). The ontologies have been extended and refined for several biological areas, and improvements to the structure of the ontologies have been implemented. To improve the quantity and quality of gene product annotations available from its public repository, the GO Consortium has launched a focused effort to provide comprehensive and detailed annotation of orthologous genes across a number of reference genomes, including human and several key model organisms. Software developments include two releases of the ontology-editing tool OBO-Edit, and improvements to the AmiGO browser interface.

...read moreread less

826 citations

Journal Article•10.1186/1471-2105-13-161•

Concept annotation in the CRAFT corpus

[...]

Michael Bada¹, Miriam R. Eckert², Donald Evans¹, Kristin Garcia¹, Krista Shipley¹, Dmitry Sitnikov, William A. Baumgartner¹, K. Bretonnel Cohen¹, Karin Verspoor¹, Judith A. Blake, Lawrence Hunter¹ - Show less +7 more•Institutions (2)

Anschutz Medical Campus¹, University of Colorado Boulder²

09 Jul 2012-BMC Bioinformatics

TL;DR: The concept annotations of the Colorado Richly Annotated Full-Text (CRAFT) Corpus have the potential to significantly advance biomedical text mining by providing a high-quality gold standard for NLP systems.

...read moreread less

Abstract: Manually annotated corpora are critical for the training and evaluation of automated methods to identify concepts in biomedical text. This paper presents the concept annotations of the Colorado Richly Annotated Full-Text (CRAFT) Corpus, a collection of 97 full-length, open-access biomedical journal articles that have been annotated both semantically and syntactically to serve as a research resource for the biomedical natural-language-processing (NLP) community. CRAFT identifies all mentions of nearly all concepts from nine prominent biomedical ontologies and terminologies: the Cell Type Ontology, the Chemical Entities of Biological Interest ontology, the NCBI Taxonomy, the Protein Ontology, the Sequence Ontology, the entries of the Entrez Gene database, and the three subontologies of the Gene Ontology. The first public release includes the annotations for 67 of the 97 articles, reserving two sets of 15 articles for future text-mining competitions (after which these too will be released). Concept annotations were created based on a single set of guidelines, which has enabled us to achieve consistently high interannotator agreement. As the initial 67-article release contains more than 560,000 tokens (and the full set more than 790,000 tokens), our corpus is among the largest gold-standard annotated biomedical corpora. Unlike most others, the journal articles that comprise the corpus are drawn from diverse biomedical disciplines and are marked up in their entirety. Additionally, with a concept-annotation count of nearly 100,000 in the 67-article subset (and more than 140,000 in the full collection), the scale of conceptual markup is also among the largest of comparable corpora. The concept annotations of the CRAFT Corpus have the potential to significantly advance biomedical text mining by providing a high-quality gold standard for NLP systems. The corpus, annotation guidelines, and other associated resources are freely available at http://bionlp-corpora.sourceforge.net/CRAFT/index.shtml .

...read moreread less

302 citations

...

Expand

Year	Papers
2021	3
2019	1
2017	1
2016	3
2015	2
2014	1

Topic Tools

Papers

Gene Ontology: tool for the unification of biology

The OBO Foundry : coordinated evolution of ontologies to support biomedical data integration

The Sequence Ontology: a tool for the unification of genome annotations

The Gene Ontology project in 2008

Concept annotation in the CRAFT corpus

Related Topics (5)

Performance Metrics