Dataset Information

Evaluation of computational genotyping of structural variation for clinical diagnoses.

ABSTRACT: Structural variation (SV) plays a pivotal role in genetic disease. The discovery of SVs based on short DNA sequence reads from next-generation DNA sequence methods is error-prone, with low sensitivity and high false discovery rates. These shortcomings can be partially overcome with extensive orthogonal validation methods or use of long reads, but the current cost precludes their application for routine clinical diagnostics. In contrast, SV genotyping of known sites of SV occurrence is relatively robust and therefore offers a cost-effective clinical diagnostic tool with potentially few false-positive and false-negative results, even when applied to short-read DNA sequence data. We assess 5 state-of-the-art SV genotyping software methods, applied to short-read sequence data. The methods are characterized on the basis of their ability to genotype different SV types, spanning different size ranges. Furthermore, we analyze their ability to parse different VCF file subformats and assess their reliance on specific metadata. We compare the SV genotyping methods across a range of simulated and real data including SVs that were not found with Illumina data alone. We assess sensitivity and the ability to filter initial false discovery calls. We determined the impact of SV type and size on the performance for each SV genotyper. Overall, STIX performed the best on both simulated and GiaB based SV calls, demonstrating a good balance between sensitivity and specificty. Our results indicate that, although SV genotyping software methods have superior performance to SV callers, there are limitations that suggest the need for further innovation.

SUBMITTER: Chander V

PROVIDER: S-EPMC6732172 | biostudies-literature | 2019 Sep

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Evaluation of computational genotyping of structural variation for clinical diagnoses.

Chander Varuna V Gibbs Richard A RA Sedlazeck Fritz J FJ

GigaScience 20190901 9

<h4>Background</h4>Structural variation (SV) plays a pivotal role in genetic disease. The discovery of SVs based on short DNA sequence reads from next-generation DNA sequence methods is error-prone, with low sensitivity and high false discovery rates. These shortcomings can be partially overcome with extensive orthogonal validation methods or use of long reads, but the current cost precludes their application for routine clinical diagnostics. In contrast, SV genotyping of known sites of SV occur ...[more]

PMID: 31494671

Dataset Information

Evaluation of computational genotyping of structural variation for clinical diagnoses.

Publications

Evaluation of computational genotyping of structural variation for clinical diagnoses.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

GraphTyper2 enables population-scale genotyping of structural variation using pangenome graphs.
| S-EPMC6881350 | biostudies-literature

Protein structural variation in computational models and crystallographic data.
| S-EPMC2350181 | biostudies-literature

Clinical and Structural Differences in Delusions Across Diagnoses: A Systematic Review.
| S-EPMC8818879 | biostudies-literature

Discovery and genotyping of structural variation from long-read haploid genome sequence data.
| S-EPMC5411763 | biostudies-literature

SV2: accurate structural variation genotyping and de novo mutation detection from whole genomes.
| S-EPMC5946924 | biostudies-literature

RBP Footprint Grand Challenge: An evaluation of novel computational approaches to RNA-binding protein target prediction from structural data
2024-02-20 | GSE227455 | GEO

Copy number variation genotyping using family information.
| S-EPMC3668900 | biostudies-other

Analytical Validation of a Computational Method for Pharmacogenetic Genotyping from Clinical Whole Exome Sequencing.
| S-EPMC9227988 | biostudies-literature

Comprehensive evaluation of structural variant genotyping methods based on long-read sequencing data.
| S-EPMC9034514 | biostudies-literature

Decoding noises in HIV computational genotyping.
| S-EPMC6443101 | biostudies-literature