Unknown

Dataset Information

0

RefSeq: an update on mammalian reference sequences.


ABSTRACT: The National Center for Biotechnology Information (NCBI) Reference Sequence (RefSeq) database is a collection of annotated genomic, transcript and protein sequence records derived from data in public sequence archives and from computation, curation and collaboration (http://www.ncbi.nlm.nih.gov/refseq/). We report here on growth of the mammalian and human subsets, changes to NCBI's eukaryotic annotation pipeline and modifications affecting transcript and protein records. Recent changes to NCBI's eukaryotic genome annotation pipeline provide higher throughput, and the addition of RNAseq data to the pipeline results in a significant expansion of the number of transcripts and novel exons annotated on mammalian RefSeq genomes. Recent annotation changes include reporting supporting evidence for transcript records, modification of exon feature annotation and the addition of a structured report of gene and sequence attributes of biological interest. We also describe a revised protein annotation policy for alternatively spliced transcripts with more divergent predicted proteins and we summarize the current status of the RefSeqGene project.

SUBMITTER: Pruitt KD 

PROVIDER: S-EPMC3965018 | biostudies-literature | 2014 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications


The National Center for Biotechnology Information (NCBI) Reference Sequence (RefSeq) database is a collection of annotated genomic, transcript and protein sequence records derived from data in public sequence archives and from computation, curation and collaboration (http://www.ncbi.nlm.nih.gov/refseq/). We report here on growth of the mammalian and human subsets, changes to NCBI's eukaryotic annotation pipeline and modifications affecting transcript and protein records. Recent changes to NCBI's  ...[more]

Similar Datasets

| S-EPMC5753331 | biostudies-literature
| S-EPMC4702849 | biostudies-literature
| S-EPMC7660235 | biostudies-literature
| S-EPMC4189044 | biostudies-literature
| S-EPMC3712216 | biostudies-literature
| S-EPMC8744684 | biostudies-literature
| S-EPMC4502323 | biostudies-literature
| PRJEB6684 | ENA
| S-EPMC102393 | biostudies-literature
| S-EPMC4064129 | biostudies-literature