Unknown

Dataset Information

0

Methodological differences can affect sequencing depth with a possible impact on the accuracy of genetic diagnosis.


ABSTRACT: For a better interpretation of variants, evidence-based databases, such as ClinVar, compile data on the presumed relationships between variants and phenotypes. In this study, we aimed to analyze the pattern of sequencing depth in variants from whole-exome sequencing data in the 1000 Genomes project phase 3, focusing on the variants present in the ClinVar database that were predicted to affect protein-coding regions. We demonstrate that the distribution of the sequencing depth varies across different sequencing centers (pair-wise comparison, p < 0.001). Most importantly, we found that the distribution pattern of sequencing depth is specific to each facility, making it possible to correctly assign 96.9% of the samples to their sequencing center. Thus, indicating the presence of a systematic bias, related to the methods used in the different facilities, which generates significant variations in breadth and depth in whole-exome sequencing data in clinically relevant regions. Our results show that methodological differences, leading to significant heterogeneity in sequencing depth, may potentially influence the accuracy of genetic diagnosis. Furthermore, our findings highlight how it is still challenging to integrate results from different sequencing centers, which may also have an impact on genomic research.

SUBMITTER: Borges MG 

PROVIDER: S-EPMC7198014 | biostudies-literature | 2020 Jan-Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Methodological differences can affect sequencing depth with a possible impact on the accuracy of genetic diagnosis.

Borges Murilo G MG   Rocha Cristiane S CS   Carvalho Benilton S BS   Lopes-Cendes Iscia I  

Genetics and molecular biology 20200101 2


For a better interpretation of variants, evidence-based databases, such as ClinVar, compile data on the presumed relationships between variants and phenotypes. In this study, we aimed to analyze the pattern of sequencing depth in variants from whole-exome sequencing data in the 1000 Genomes project phase 3, focusing on the variants present in the ClinVar database that were predicted to affect protein-coding regions. We demonstrate that the distribution of the sequencing depth varies across diffe  ...[more]

Similar Datasets

| S-EPMC9067045 | biostudies-literature
| S-EPMC7331128 | biostudies-literature
| S-EPMC5550947 | biostudies-other
| S-EPMC4027199 | biostudies-literature
| S-EPMC4516391 | biostudies-literature
| S-EPMC4509609 | biostudies-literature
| S-EPMC9140952 | biostudies-literature
| S-EPMC7792814 | biostudies-literature
2013-07-12 | E-GEOD-46323 | biostudies-arrayexpress
| S-EPMC6954822 | biostudies-literature