Unknown

Dataset Information

0

SV-Bay: structural variant detection in cancer genomes using a Bayesian approach with correction for GC-content and read mappability.


ABSTRACT:

Motivation

Whole genome sequencing of paired-end reads can be applied to characterize the landscape of large somatic rearrangements of cancer genomes. Several methods for detecting structural variants with whole genome sequencing data have been developed. So far, none of these methods has combined information about abnormally mapped read pairs connecting rearranged regions and associated global copy number changes automatically inferred from the same sequencing data file. Our aim was to create a computational method that could use both types of information, i.e. normal and abnormal reads, and demonstrate that by doing so we can highly improve both sensitivity and specificity rates of structural variant prediction.

Results

We developed a computational method, SV-Bay, to detect structural variants from whole genome sequencing mate-pair or paired-end data using a probabilistic Bayesian approach. This approach takes into account depth of coverage by normal reads and abnormalities in read pair mappings. To estimate the model likelihood, SV-Bay considers GC-content and read mappability of the genome, thus making important corrections to the expected read count. For the detection of somatic variants, SV-Bay makes use of a matched normal sample when it is available. We validated SV-Bay on simulated datasets and an experimental mate-pair dataset for the CLB-GA neuroblastoma cell line. The comparison of SV-Bay with several other methods for structural variant detection demonstrated that SV-Bay has better prediction accuracy both in terms of sensitivity and false-positive detection rate.

Availability and implementation

https://github.com/InstitutCurie/SV-Bay

Contact

valentina.boeva@inserm.fr

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Iakovishina D 

PROVIDER: S-EPMC4896370 | biostudies-literature | 2016 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

SV-Bay: structural variant detection in cancer genomes using a Bayesian approach with correction for GC-content and read mappability.

Iakovishina Daria D   Janoueix-Lerosey Isabelle I   Barillot Emmanuel E   Regnier Mireille M   Boeva Valentina V  

Bioinformatics (Oxford, England) 20160106 7


<h4>Motivation</h4>Whole genome sequencing of paired-end reads can be applied to characterize the landscape of large somatic rearrangements of cancer genomes. Several methods for detecting structural variants with whole genome sequencing data have been developed. So far, none of these methods has combined information about abnormally mapped read pairs connecting rearranged regions and associated global copy number changes automatically inferred from the same sequencing data file. Our aim was to  ...[more]

Similar Datasets

| S-EPMC8153448 | biostudies-literature
| S-EPMC3274465 | biostudies-literature
2015-01-23 | GSE56639 | GEO
2015-01-23 | E-GEOD-56639 | biostudies-arrayexpress
| S-EPMC3720884 | biostudies-literature
| S-EPMC4450053 | biostudies-literature
| S-EPMC4349097 | biostudies-literature
| S-EPMC3799449 | biostudies-literature
| S-EPMC6080486 | biostudies-literature
| S-EPMC6867656 | biostudies-literature