Unknown

Dataset Information

0

Extreme purifying selection against point mutations in the human genome.


ABSTRACT: Large-scale genome sequencing has enabled the measurement of strong purifying selection in protein-coding genes. Here we describe a new method, called ExtRaINSIGHT, for measuring such selection in noncoding as well as coding regions of the human genome. ExtRaINSIGHT estimates the prevalence of "ultraselection" by the fractional depletion of rare single-nucleotide variants, after controlling for variation in mutation rates. Applying ExtRaINSIGHT to 71,702 whole genome sequences from gnomAD v3, we find abundant ultraselection in evolutionarily ancient miRNAs and neuronal protein-coding genes, as well as at splice sites. By contrast, we find much less ultraselection in other noncoding RNAs and transcription factor binding sites, and only modest levels in ultraconserved elements. We estimate that ~0.4-0.7% of the human genome is ultraselected, implying ~ 0.26-0.51 strongly deleterious mutations per generation. Overall, our study sheds new light on the genome-wide distribution of fitness effects by combining deep sequencing data and classical theory from population genetics.

SUBMITTER: Dukler N 

PROVIDER: S-EPMC9314448 | biostudies-literature | 2022 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Extreme purifying selection against point mutations in the human genome.

Dukler Noah N   Mughal Mehreen R MR   Ramani Ritika R   Huang Yi-Fei YF   Siepel Adam A  

Nature communications 20220725 1


Large-scale genome sequencing has enabled the measurement of strong purifying selection in protein-coding genes. Here we describe a new method, called ExtRaINSIGHT, for measuring such selection in noncoding as well as coding regions of the human genome. ExtRaINSIGHT estimates the prevalence of "ultraselection" by the fractional depletion of rare single-nucleotide variants, after controlling for variation in mutation rates. Applying ExtRaINSIGHT to 71,702 whole genome sequences from gnomAD v3, we  ...[more]

Similar Datasets

| S-EPMC10544217 | biostudies-literature
| S-EPMC1941483 | biostudies-literature
| S-EPMC7593775 | biostudies-literature
| S-EPMC3499406 | biostudies-literature
| S-EPMC11615325 | biostudies-literature
| S-EPMC4091738 | biostudies-literature
| S-EPMC2577867 | biostudies-literature
| S-EPMC6798728 | biostudies-literature
| S-EPMC2667980 | biostudies-literature
| S-EPMC2694979 | biostudies-literature