Unknown

Dataset Information

0

MetaCNV - a consensus approach to infer accurate copy numbers from low coverage data.


ABSTRACT: BACKGROUND:The majority of copy number callers requires high read coverage data that is often achieved with elevated material input, which increases the heterogeneity of tissue samples. However, to gain insights into smaller areas within a tissue sample, e.g. a cancerous area in a heterogeneous tissue sample, less material is used for sequencing, which results in lower read coverage. Therefore, more focus needs to be put on copy number calling that is sensitive enough for low coverage data. RESULTS:We present MetaCNV, a copy number caller that infers reliable copy numbers for human genomes with a consensus approach. MetaCNV specializes in low coverage data, but also performs well on normal and high coverage data. MetaCNV integrates the results of multiple copy number callers and infers absolute and unbiased copy numbers for the entire genome. MetaCNV is based on a meta-model that bypasses the weaknesses of current calling models while combining the strengths of existing approaches. Here we apply MetaCNV based on ReadDepth, SVDetect, and CNVnator to real and simulated datasets in order to demonstrate how the approach improves copy number calling. CONCLUSIONS:MetaCNV, available at https://bitbucket.org/sonnhammergroup/metacnv, provides accurate copy number prediction on low coverage data and performs well on high coverage data.

SUBMITTER: Friedrich S 

PROVIDER: S-EPMC7268502 | biostudies-literature | 2020 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

MetaCNV - a consensus approach to infer accurate copy numbers from low coverage data.

Friedrich Stefanie S   Barbulescu Remus R   Helleday Thomas T   Sonnhammer Erik L L ELL  

BMC medical genomics 20200601 1


<h4>Background</h4>The majority of copy number callers requires high read coverage data that is often achieved with elevated material input, which increases the heterogeneity of tissue samples. However, to gain insights into smaller areas within a tissue sample, e.g. a cancerous area in a heterogeneous tissue sample, less material is used for sequencing, which results in lower read coverage. Therefore, more focus needs to be put on copy number calling that is sensitive enough for low coverage da  ...[more]

Similar Datasets

| S-EPMC8236193 | biostudies-literature
| S-EPMC7205152 | biostudies-literature
| S-EPMC3511991 | biostudies-literature
| S-EPMC4341071 | biostudies-literature
| S-EPMC3679970 | biostudies-literature
2015-10-01 | GSE73191 | GEO
| S-EPMC5737671 | biostudies-literature
| S-EPMC2752127 | biostudies-literature
2021-07-19 | GSE165336 | GEO