Unknown

Dataset Information

0

Solving the Problem: Genome Annotation Standards before the Data Deluge.


ABSTRACT: The promise of genome sequencing was that the vast undiscovered country would be mapped out by comparison of the multitude of sequences available and would aid researchers in deciphering the role of each gene in every organism. Researchers recognize that there is a need for high quality data. However, different annotation procedures, numerous databases, and a diminishing percentage of experimentally determined gene functions have resulted in a spectrum of annotation quality. NCBI in collaboration with sequencing centers, archival databases, and researchers, has developed the first international annotation standards, a fundamental step in ensuring that high quality complete prokaryotic genomes are available as gold standard references. Highlights include the development of annotation assessment tools, community acceptance of protein naming standards, comparison of annotation resources to provide consistent annotation, and improved tracking of the evidence used to generate a particular annotation. The development of a set of minimal standards, including the requirement for annotated complete prokaryotic genomes to contain a full set of ribosomal RNAs, transfer RNAs, and proteins encoding core conserved functions, is an historic milestone. The use of these standards in existing genomes and future submissions will increase the quality of databases, enabling researchers to make accurate biological discoveries.

SUBMITTER: Klimke W 

PROVIDER: S-EPMC3236044 | biostudies-literature | 2011 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Solving the Problem: Genome Annotation Standards before the Data Deluge.

Klimke William W   O'Donovan Claire C   White Owen O   Brister J Rodney JR   Clark Karen K   Fedorov Boris B   Mizrachi Ilene I   Pruitt Kim D KD   Tatusova Tatiana T  

Standards in genomic sciences 20111001 1


The promise of genome sequencing was that the vast undiscovered country would be mapped out by comparison of the multitude of sequences available and would aid researchers in deciphering the role of each gene in every organism. Researchers recognize that there is a need for high quality data. However, different annotation procedures, numerous databases, and a diminishing percentage of experimentally determined gene functions have resulted in a spectrum of annotation quality. NCBI in collaboratio  ...[more]

Similar Datasets

| S-EPMC3706742 | biostudies-literature
| S-EPMC3698034 | biostudies-literature
| PRJEB40949 | ENA
| S-EPMC5641385 | biostudies-other
| S-EPMC4523725 | biostudies-literature
| S-EPMC8606407 | biostudies-literature
| S-EPMC7979646 | biostudies-literature
| S-EPMC310723 | biostudies-literature
| S-EPMC7522476 | biostudies-literature
| S-EPMC4163035 | biostudies-literature