Dataset Information

Consensus coding sequence (CCDS) database: a standardized set of human and mouse protein-coding regions supported by expert curation.

ABSTRACT: The Consensus Coding Sequence (CCDS) project provides a dataset of protein-coding regions that are identically annotated on the human and mouse reference genome assembly in genome annotations produced independently by NCBI and the Ensembl group at EMBL-EBI. This dataset is the product of an international collaboration that includes NCBI, Ensembl, HUGO Gene Nomenclature Committee, Mouse Genome Informatics and University of California, Santa Cruz. Identically annotated coding regions, which are generated using an automated pipeline and pass multiple quality assurance checks, are assigned a stable and tracked identifier (CCDS ID). Additionally, coordinated manual review by expert curators from the CCDS collaboration helps in maintaining the integrity and high quality of the dataset. The CCDS data are available through an interactive web page (https://www.ncbi.nlm.nih.gov/CCDS/CcdsBrowse.cgi) and an FTP site (ftp://ftp.ncbi.nlm.nih.gov/pub/CCDS/). In this paper, we outline the ongoing work, growth and stability of the CCDS dataset and provide updates on new collaboration members and new features added to the CCDS user interface. We also present expert curation scenarios, with specific examples highlighting the importance of an accurate reference genome assembly and the crucial role played by input from the research community.

SUBMITTER: Pujar S

PROVIDER: S-EPMC5753299 | biostudies-literature | 2018 Jan

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Consensus coding sequence (CCDS) database: a standardized set of human and mouse protein-coding regions supported by expert curation.

Pujar Shashikant S O'Leary Nuala A NA Farrell Catherine M CM Loveland Jane E JE Mudge Jonathan M JM Wallin Craig C Girón Carlos G CG Diekhans Mark M Barnes If I Bennett Ruth R Berry Andrew E AE Cox Eric E Davidson Claire C Goldfarb Tamara T Gonzalez Jose M JM Hunt Toby T Jackson John J Joardar Vinita V Kay Mike P MP Kodali Vamsi K VK Martin Fergal J FJ McAndrews Monica M McGarvey Kelly M KM Murphy Michael M Rajput Bhanu B Rangwala Sanjida H SH Riddick Lillian D LD Seal Ruth L RL Suner Marie-Marthe MM Webb David D Zhu Sophia S Aken Bronwen L BL Bruford Elspeth A EA Bult Carol J CJ Frankish Adam A Murphy Terence T Pruitt Kim D KD

Nucleic acids research 20180101 D1

The Consensus Coding Sequence (CCDS) project provides a dataset of protein-coding regions that are identically annotated on the human and mouse reference genome assembly in genome annotations produced independently by NCBI and the Ensembl group at EMBL-EBI. This dataset is the product of an international collaboration that includes NCBI, Ensembl, HUGO Gene Nomenclature Committee, Mouse Genome Informatics and University of California, Santa Cruz. Identically annotated coding regions, which are ge ...[more]

PMID: 29126148

Similar Datasets

Project description:ObjectivesMonitoring the appropriateness of antibiotic prescriptions with indicators based on reimbursement data is required to guide antibiotic stewardship (AMS) interventions in nursing homes (NHs). Quantity metrics (QMs) monitor the volume of prescriptions while proxy indicators (PIs) reflect the appropriateness of antibiotic use. Our objectives were: (i) to provide a relevant consensual set of indicators to be used in French NHs; and (ii) to assess the feasibility of their implementation at the national and local scale.MethodsNine French professional organizations implicated in AMS in NHs were asked to nominate at least one member to create a national expert panel of 20 physicians. Twenty-one recently published QMs and 11 PIs were assessed by the expert panel. Indicators were evaluated using a RAND-modified Delphi procedure comprising two online surveys and a videoconference meeting. Indicators were kept in the final list if >70% of stakeholders validated their relevance for estimating the volume (QMs) and appropriateness (PIs) of prescriptions.ResultsOf the 21 QM indicators submitted to the panel, 14 were selected, describing the consumption of antibiotics overall (n = 3), broad-spectrum (n = 6) and second-line antibiotics (n = 2). The three remaining QMs evaluated the route of administration (n = 1) and urine culture prescriptions (n = 2). Ten PIs (six modified, two rejected, one new) were selected to assess the appropriateness of prescriptions for urinary tract infections (n = 2), seasonal variations in prescriptions (n = 2), repeated prescriptions of fluoroquinolones (n = 1), cephalosporins' route of administration (n = 1), duration of treatment (n = 1), rate of second-line antibiotics (n = 1), co-prescriptions with non-steroidal anti-inflammatory drugs (n = 1), and flu vaccine coverage (n = 1). The panel was in favour of using these indicators for regional and facility level AMS programmes (91%), feedback to NH prescribers (82%), benchmarking by health authorities (55%) and public reporting at the facility level (9%).ConclusionsThis consensual list of indicators, covering a wide range of frequent clinical situations, may be used as part of the French national AMS strategy for monitoring antibiotic prescriptions in NHs at the national and local levels. Regional AMS networks might manage this selected list to guide personalized action plans with concrete objectives of reducing the quantity and improving the quality of antibiotic prescriptions.

Dataset Information

Consensus coding sequence (CCDS) database: a standardized set of human and mouse protein-coding regions supported by expert curation.

Publications

Consensus coding sequence (CCDS) database: a standardized set of human and mouse protein-coding regions supported by expert curation.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets