Unknown

Dataset Information

0

A Deep Learning Approach for Detecting Copy Number Variation in Next-Generation Sequencing Data.


ABSTRACT: Copy number variants (CNV) are associated with phenotypic variation in several species. However, properly detecting changes in copy numbers of sequences remains a difficult problem, especially in lower quality or lower coverage next-generation sequencing data. Here, inspired by recent applications of machine learning in genomics, we describe a method to detect duplications and deletions in short-read sequencing data. In low coverage data, machine learning appears to be more powerful in the detection of CNVs than the gold-standard methods of coverage estimation alone, and of equal power in high coverage data. We also demonstrate how replicating training sets allows a more precise detection of CNVs, even identifying novel CNVs in two genomes previously surveyed thoroughly for CNVs using long read data.

SUBMITTER: Hill T 

PROVIDER: S-EPMC6829143 | biostudies-literature | 2019 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

A Deep Learning Approach for Detecting Copy Number Variation in Next-Generation Sequencing Data.

Hill Tom T   Unckless Robert L RL  

G3 (Bethesda, Md.) 20191105 11


Copy number variants (CNV) are associated with phenotypic variation in several species. However, properly detecting changes in copy numbers of sequences remains a difficult problem, especially in lower quality or lower coverage next-generation sequencing data. Here, inspired by recent applications of machine learning in genomics, we describe a method to detect duplications and deletions in short-read sequencing data. In low coverage data, machine learning appears to be more powerful in the detec  ...[more]

Similar Datasets

| S-EPMC8406611 | biostudies-literature
| S-EPMC4021345 | biostudies-literature
| S-EPMC9887524 | biostudies-literature
| S-EPMC3317159 | biostudies-literature
| S-EPMC5427176 | biostudies-literature
| S-EPMC5655909 | biostudies-other
| S-EPMC3604020 | biostudies-literature
| S-EPMC7471150 | biostudies-literature
| S-EPMC4111851 | biostudies-literature
| S-EPMC5418319 | biostudies-literature