Unknown

Dataset Information

0

Enhanced copy number variants detection from whole-exome sequencing data using EXCAVATOR2.


ABSTRACT: Copy Number Variants (CNVs) are structural rearrangements contributing to phenotypic variation that have been proved to be associated with many disease states. Over the last years, the identification of CNVs from whole-exome sequencing (WES) data has become a common practice for research and clinical purpose and, consequently, the demand for more and more efficient and accurate methods has increased. In this paper, we demonstrate that more than 30% of WES data map outside the targeted regions and that these reads, usually discarded, can be exploited to enhance the identification of CNVs from WES experiments. Here, we present EXCAVATOR2, the first read count based tool that exploits all the reads produced by WES experiments to detect CNVs with a genome-wide resolution. To evaluate the performance of our novel tool we use it for analysing two WES data sets, a population data set sequenced by the 1000 Genomes Project and a tumor data set made of bladder cancer samples. The results obtained from these analyses demonstrate that EXCAVATOR2 outperforms other four state-of-the-art methods and that our combined approach enlarge the spectrum of detectable CNVs from WES data with an unprecedented resolution. EXCAVATOR2 is freely available at http://sourceforge.net/projects/excavator2tool/.

SUBMITTER: D'Aurizio R 

PROVIDER: S-EPMC5175347 | biostudies-literature | 2016 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Enhanced copy number variants detection from whole-exome sequencing data using EXCAVATOR2.

D'Aurizio Romina R   Pippucci Tommaso T   Tattini Lorenzo L   Giusti Betti B   Pellegrini Marco M   Magi Alberto A  

Nucleic acids research 20160809 20


Copy Number Variants (CNVs) are structural rearrangements contributing to phenotypic variation that have been proved to be associated with many disease states. Over the last years, the identification of CNVs from whole-exome sequencing (WES) data has become a common practice for research and clinical purpose and, consequently, the demand for more and more efficient and accurate methods has increased. In this paper, we demonstrate that more than 30% of WES data map outside the targeted regions an  ...[more]

Similar Datasets

| S-EPMC4053953 | biostudies-literature
| S-EPMC4081054 | biostudies-literature
| S-EPMC6126229 | biostudies-literature
| S-EPMC10762021 | biostudies-literature
| S-EPMC5452530 | biostudies-literature
| S-EPMC4849420 | biostudies-literature
| S-EPMC7604644 | biostudies-literature
| S-EPMC5494116 | biostudies-literature
| S-EPMC4587906 | biostudies-literature
| S-EPMC8699073 | biostudies-literature