Unknown

Dataset Information

0

Whole-exome sequencing provides insights into monogenic disease prevalence in Northwest Russia.


ABSTRACT: BACKGROUND:Allele frequency data from large exome and genome aggregation projects such as the Genome Aggregation Database (gnomAD) are of ultimate importance to the interpretation of medical resequencing data. However, allele frequencies might significantly differ in poorly studied populations that are underrepresented in large-scale projects, such as the Russian population. METHODS:In this work, we leveraged our access to a large dataset of 694 exome samples to analyze genetic variation in the Northwest Russia. We compared the spectrum of genetic variants to the dbSNP build 151, and made estimates of ClinVar-based autosomal recessive (AR) disease allele prevalence as compared to gnomAD r. 2.1. RESULTS:An estimated 9.3% of discovered variants were not present in dbSNP. We report statistically significant overrepresentation of pathogenic variants for several Mendelian disorders, including phenylketonuria (PAH, rs5030858), Wilson's disease (ATP7B, rs76151636), factor VII deficiency (F7, rs36209567), kyphoscoliosis type of Ehlers-Danlos syndrome (FKBP14, rs542489955), and several other recessive pathologies. We also make primary estimates of monogenic disease incidence in the population, with retinal dystrophy, cystic fibrosis, and phenylketonuria being the most frequent AR pathologies. CONCLUSION:Our observations demonstrate the utility of population-specific allele frequency data to the diagnosis of monogenic disorders using high-throughput technologies.

SUBMITTER: Barbitoff YA 

PROVIDER: S-EPMC6825859 | biostudies-literature | 2019 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications


<h4>Background</h4>Allele frequency data from large exome and genome aggregation projects such as the Genome Aggregation Database (gnomAD) are of ultimate importance to the interpretation of medical resequencing data. However, allele frequencies might significantly differ in poorly studied populations that are underrepresented in large-scale projects, such as the Russian population.<h4>Methods</h4>In this work, we leveraged our access to a large dataset of 694 exome samples to analyze genetic va  ...[more]

Similar Datasets

| S-EPMC5843550 | biostudies-literature
| S-EPMC8440967 | biostudies-literature
| S-EPMC4406825 | biostudies-literature
| S-EPMC3585308 | biostudies-literature
| S-EPMC5750088 | biostudies-literature
| S-EPMC8250434 | biostudies-literature
| S-EPMC10851780 | biostudies-literature
| S-EPMC4946647 | biostudies-literature
| S-EPMC8326439 | biostudies-literature
| S-EPMC5710405 | biostudies-literature