Unknown

Dataset Information

0

Learning smoothing models of copy number profiles using breakpoint annotations.


ABSTRACT:

Background

Many models have been proposed to detect copy number alterations in chromosomal copy number profiles, but it is usually not obvious to decide which is most effective for a given data set. Furthermore, most methods have a smoothing parameter that determines the number of breakpoints and must be chosen using various heuristics.

Results

We present three contributions for copy number profile smoothing model selection. First, we propose to select the model and degree of smoothness that maximizes agreement with visual breakpoint region annotations. Second, we develop cross-validation procedures to estimate the error of the trained models. Third, we apply these methods to compare 17 smoothing models on a new database of 575 annotated neuroblastoma copy number profiles, which we make available as a public benchmark for testing new algorithms.

Conclusions

Whereas previous studies have been qualitative or limited to simulated data, our annotation-guided approach is quantitative and suggests which algorithms are fastest and most accurate in practice on real data. In the neuroblastoma data, the equivalent pelt.n and cghseg.k methods were the best breakpoint detectors, and exhibited reasonable computation times.

SUBMITTER: Hocking TD 

PROVIDER: S-EPMC3712326 | biostudies-literature | 2013 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Learning smoothing models of copy number profiles using breakpoint annotations.

Hocking Toby Dylan TD   Schleiermacher Gudrun G   Janoueix-Lerosey Isabelle I   Boeva Valentina V   Cappo Julie J   Delattre Olivier O   Bach Francis F   Vert Jean-Philippe JP  

BMC bioinformatics 20130522


<h4>Background</h4>Many models have been proposed to detect copy number alterations in chromosomal copy number profiles, but it is usually not obvious to decide which is most effective for a given data set. Furthermore, most methods have a smoothing parameter that determines the number of breakpoints and must be chosen using various heuristics.<h4>Results</h4>We present three contributions for copy number profile smoothing model selection. First, we propose to select the model and degree of smoo  ...[more]

Similar Datasets

| S-EPMC2577857 | biostudies-literature
| S-EPMC9272807 | biostudies-literature
| S-EPMC5062840 | biostudies-other
| S-EPMC8504801 | biostudies-literature
| S-EPMC7160889 | biostudies-literature
| S-EPMC4330915 | biostudies-literature
| S-EPMC5285462 | biostudies-literature
| S-EPMC3411951 | biostudies-literature
| S-EPMC4866742 | biostudies-literature