Unknown

Dataset Information

0

CADD: predicting the deleteriousness of variants throughout the human genome.


ABSTRACT: Combined Annotation-Dependent Depletion (CADD) is a widely used measure of variant deleteriousness that can effectively prioritize causal variants in genetic analyses, particularly highly penetrant contributors to severe Mendelian disorders. CADD is an integrative annotation built from more than 60 genomic features, and can score human single nucleotide variants and short insertion and deletions anywhere in the reference assembly. CADD uses a machine learning model trained on a binary distinction between simulated de novo variants and variants that have arisen and become fixed in human populations since the split between humans and chimpanzees; the former are free of selective pressure and may thus include both neutral and deleterious alleles, while the latter are overwhelmingly neutral (or, at most, weakly deleterious) by virtue of having survived millions of years of purifying selection. Here we review the latest updates to CADD, including the most recent version, 1.4, which supports the human genome build GRCh38. We also present updates to our website that include simplified variant lookup, extended documentation, an Application Program Interface and improved mechanisms for integrating CADD scores into other tools or applications. CADD scores, software and documentation are available at https://cadd.gs.washington.edu.

SUBMITTER: Rentzsch P 

PROVIDER: S-EPMC6323892 | biostudies-other | 2019 Jan

REPOSITORIES: biostudies-other

altmetric image

Publications

CADD: predicting the deleteriousness of variants throughout the human genome.

Rentzsch Philipp P   Witten Daniela D   Cooper Gregory M GM   Shendure Jay J   Kircher Martin M  

Nucleic acids research 20190101 D1


Combined Annotation-Dependent Depletion (CADD) is a widely used measure of variant deleteriousness that can effectively prioritize causal variants in genetic analyses, particularly highly penetrant contributors to severe Mendelian disorders. CADD is an integrative annotation built from more than 60 genomic features, and can score human single nucleotide variants and short insertion and deletions anywhere in the reference assembly. CADD uses a machine learning model trained on a binary distinctio  ...[more]

Similar Datasets

| S-EPMC1802602 | biostudies-literature
| S-EPMC8948160 | biostudies-literature
| S-BSMS2 | biostudies-other
| S-EPMC3979972 | biostudies-literature
| S-EPMC7214033 | biostudies-literature
| S-EPMC2775593 | biostudies-literature
| S-EPMC4896702 | biostudies-other
| S-EPMC6280872 | biostudies-other
| S-EPMC2954820 | biostudies-literature
| S-EPMC4416779 | biostudies-literature