Genomics

Dataset Information

0

Allele Balance Bias Identifies Systematic Genotyping Errors and False Disease Associations


ABSTRACT: In recent years, Next Generation Sequencing (NGS) has become a cornerstone of clinical genetics and diagnostics. Many clinical applications require high precision, especially if rare events such as somatic mutations in cancer or genetic variants causing rare diseases need to be identified. Although random sequencing errors can be modeled statistically and deep sequencing minimizes their impact, systematic errors remain a problem even at high depth of coverage. Understanding their source is crucial to increase precision of clinical NGS applications. In this work, we studied the relation between recurrent biases in allele balance (AB), systematic errors and false positive variant calls across a large cohort of human samples analyzed by whole exome sequencing (WES). We have modeled the allele balance distribution for biallelic genotypes in 987 WES samples in order to identify positions recurrently deviating significantly from the expectation, a phenomenon we termed allele balance bias (ABB). Furthermore, we have developed a genotype callability score based on ABB for all positions of the human exome, which detects false positive variant calls that passed state-of-the-art filters. Finally, we demonstrate the use of ABB for detection of false associations proposed by rare variant association studies (RVAS).

PROVIDER: EGAS00001003027 | EGA |

REPOSITORIES: EGA

Similar Datasets

2012-03-03 | GSE36217 | GEO
2012-03-03 | E-GEOD-36217 | biostudies-arrayexpress
2022-09-09 | MTBLS833 | MetaboLights
2022-09-09 | MTBLS834 | MetaboLights
2018-12-15 | GSE85806 | GEO
| phs001021 | dbGaP
| PRJEB55823 | ENA
2021-07-23 | GSE155099 | GEO
2019-06-19 | PXD012628 | Pride
| PRJNA693080 | ENA