Concepedia

Publication | Open Access

Creating the Gene Ontology Resource: Design and Implementation

957

Citations

14

References

2001

Year

TLDR

The rapid expansion of biological data has created confusion over gene annotation, prompting the GO Consortium—formed by major model organism databases and now joined by others—to unify terminology. The GO project aims to supply structured vocabularies for molecular function, biological process, and cellular component that can describe gene products across all organisms. The authors built three ontologies, integrated annotations from model organism databases into a shared public resource at geneontology.org, and provided tools for curators and researchers to query and manipulate the vocabularies. The shared development of this annotation resource is expected to unify biological information.

Abstract

The exponential growth in the volume of accessible biological information has generated a confusion of voices surrounding the annotation of molecular information about genes and their products. The Gene Ontology (GO) project seeks to provide a set of structured vocabularies for specific biological domains that can be used to describe gene products in any organism. This work includes building three extensive ontologies to describe molecular function, biological process, and cellular component, and providing a community database resource that supports the use of these ontologies. The GO Consortium was initiated by scientists associated with three model organism databases: SGD, the Saccharomyces Genome database; FlyBase, the Drosophila genome database; and MGD/GXD, the Mouse Genome Informatics databases. Additional model organism database groups are joining the project. Each of these model organism information systems is annotating genes and gene products using GO vocabulary terms and incorporating these annotations into their respective model organism databases. Each database contributes its annotation files to a shared GO data resource accessible to the public at http://www.geneontology.org/. The GO site can be used by the community both to recover the GO vocabularies and to access the annotated gene product data sets from the model organism databases. The GO Consortium supports the development of the GO database resource and provides tools enabling curators and researchers to query and manipulate the vocabularies. We believe that the shared development of this molecular annotation resource will contribute to the unification of biological information.

References

YearCitations

Page 1