Unknown

Dataset Information

0

Generating realistic null hypothesis of cancer mutational landscapes using SigProfilerSimulator


ABSTRACT: Background Performing a statistical test requires a null hypothesis. In cancer genomics, a key challenge is the fast generation of accurate somatic mutational landscapes that can be used as a realistic null hypothesis for making biological discoveries. Results Here we present SigProfilerSimulator, a powerful tool that is capable of simulating the mutational landscapes of thousands of cancer genomes at different resolutions within seconds. Applying SigProfilerSimulator to 2144 whole-genome sequenced cancers reveals: (i) that most doublet base substitutions are not due to two adjacent single base substitutions but likely occur as single genomic events; (ii) that an extended sequencing context of ±?2 bp is required to more completely capture the patterns of substitution mutational signatures in human cancer; (iii) information on false-positive discovery rate of commonly used bioinformatics tools for detecting driver genes. Conclusions SigProfilerSimulator’s breadth of features allows one to construct a tailored null hypothesis and use it for evaluating the accuracy of other bioinformatics tools or for downstream statistical analysis for biological discoveries. SigProfilerSimulator is freely available at https://github.com/AlexandrovLab/SigProfilerSimulator with an extensive documentation at https://osf.io/usxjz/wiki/home/.

SUBMITTER: Bergstrom E 

PROVIDER: S-EPMC7539472 | biostudies-literature | 2020 Jan

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC5147777 | biostudies-literature
| S-EPMC7805882 | biostudies-literature
| S-EPMC6225971 | biostudies-other
| S-EPMC9035066 | biostudies-literature
| S-EPMC5744871 | biostudies-literature
| S-EPMC11370959 | biostudies-literature
| S-EPMC10899878 | biostudies-literature
| S-EPMC9402440 | biostudies-literature
| S-EPMC5548793 | biostudies-literature
| S-EPMC4804330 | biostudies-literature