Unknown

Dataset Information

0

Isma: an R package for the integrative analysis of mutations detected by multiple pipelines.


ABSTRACT: BACKGROUND:Recent comparative studies have brought to our attention how somatic mutation detection from next-generation sequencing data is still an open issue in bioinformatics, because different pipelines result in a low consensus. In this context, it is suggested to integrate results from multiple calling tools, but this operation is not trivial and the burden of merging, comparing, filtering and explaining the results demands appropriate software. RESULTS:We developed isma (integrative somatic mutation analysis), an R package for the integrative analysis of somatic mutations detected by multiple pipelines for matched tumor-normal samples. The package provides a series of functions to quantify the consensus, estimate the variability, underline outliers, integrate evidences from publicly available mutation catalogues and filter sites. We illustrate the capabilities of isma analysing breast cancer somatic mutations generated by The Cancer Genome Atlas (TCGA) using four pipelines. CONCLUSIONS:Comparing different "points of view" on the same data, isma generates a unique mutation catalogue and a series of reports that underline common patterns, variability, as well as sites already catalogued by other studies (e.g. TCGA), so as to design and apply filtering strategies to screen more reliable sites. The package is available for non-commercial users at the URL https://www.itb.cnr.it/isma .

SUBMITTER: Di Nanni N 

PROVIDER: S-EPMC6394085 | biostudies-other | 2019 Feb

REPOSITORIES: biostudies-other

altmetric image

Publications

isma: an R package for the integrative analysis of mutations detected by multiple pipelines.

Di Nanni Noemi N   Moscatelli Marco M   Gnocchi Matteo M   Milanesi Luciano L   Mosca Ettore E  

BMC bioinformatics 20190228 1


<h4>Background</h4>Recent comparative studies have brought to our attention how somatic mutation detection from next-generation sequencing data is still an open issue in bioinformatics, because different pipelines result in a low consensus. In this context, it is suggested to integrate results from multiple calling tools, but this operation is not trivial and the burden of merging, comparing, filtering and explaining the results demands appropriate software.<h4>Results</h4>We developed isma (int  ...[more]

Similar Datasets

| S-EPMC3810856 | biostudies-other
| S-EPMC7194084 | biostudies-literature
| S-EPMC4856967 | biostudies-literature
| S-EPMC9040017 | biostudies-literature
| S-EPMC4155609 | biostudies-literature
| S-EPMC8012210 | biostudies-literature
| S-EPMC10916305 | biostudies-literature
2020-12-05 | GSE162690 | GEO
| S-EPMC8666431 | biostudies-literature
| S-EPMC8565848 | biostudies-literature