Unknown

Dataset Information

0

Promoter CpG Density Predicts Downstream Gene Loss-of-Function Intolerance.


ABSTRACT: The aggregation and joint analysis of large numbers of exome sequences has recently made it possible to derive estimates of intolerance to loss-of-function (LoF) variation for human genes. Here, we demonstrate strong and widespread coupling between genic LoF intolerance and promoter CpG density across the human genome. Genes downstream of the most CpG-rich promoters (top 10% CpG density) have a 67.2% probability of being highly LoF intolerant, using the LOEUF metric from gnomAD. This is in contrast to 7.4% of genes downstream of the most CpG-poor (bottom 10% CpG density) promoters. Combining promoter CpG density with exonic and promoter conservation explains 33.4% of the variation in LOEUF, and the contribution of CpG density exceeds the individual contributions of exonic and promoter conservation. We leverage this to train a simple and easily interpretable predictive model that outperforms other existing predictors and allows us to classify 1,760 genes-which are currently unascertained in gnomAD-as highly LoF intolerant or not. These predictions have the potential to aid in the interpretation of novel variants in the clinical setting. Moreover, our results reveal that high CpG density is not merely a generic feature of human promoters but is preferentially encountered at the promoters of the most selectively constrained genes, calling into question the prevailing view that CpG islands are not subject to selection.

SUBMITTER: Boukas L 

PROVIDER: S-EPMC7477270 | biostudies-literature | 2020 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Promoter CpG Density Predicts Downstream Gene Loss-of-Function Intolerance.

Boukas Leandros L   Bjornsson Hans T HT   Hansen Kasper D KD  

American journal of human genetics 20200814 3


The aggregation and joint analysis of large numbers of exome sequences has recently made it possible to derive estimates of intolerance to loss-of-function (LoF) variation for human genes. Here, we demonstrate strong and widespread coupling between genic LoF intolerance and promoter CpG density across the human genome. Genes downstream of the most CpG-rich promoters (top 10% CpG density) have a 67.2% probability of being highly LoF intolerant, using the LOEUF metric from gnomAD. This is in contr  ...[more]

Similar Datasets

| S-EPMC2666042 | biostudies-literature
| S-EPMC3973331 | biostudies-literature
| S-EPMC8329181 | biostudies-literature
| S-EPMC3277105 | biostudies-literature
| S-EPMC5528631 | biostudies-literature
| S-EPMC4557908 | biostudies-literature
| S-EPMC6027561 | biostudies-literature
| S-EPMC2775588 | biostudies-literature
| S-EPMC4438505 | biostudies-literature
| S-ECPF-GEOD-15154 | biostudies-other