Unknown

Dataset Information

0

Using sigLASSO to optimize cancer mutation signatures jointly with sampling likelihood.


ABSTRACT: Multiple mutational processes drive carcinogenesis, leaving characteristic signatures in tumor genomes. Determining the active signatures from a full repertoire of potential ones helps elucidate mechanisms of cancer development. This involves optimally decomposing the counts of cancer mutations, tabulated according to their trinucleotide context, into a linear combination of known signatures. Here, we develop sigLASSO (a software tool at github.com/gersteinlab/siglasso) to carry out this optimization efficiently. sigLASSO has four key aspects: (1) It jointly optimizes the likelihood of sampling and signature fitting, by explicitly factoring multinomial sampling into the objective function. This is particularly important when mutation counts are low and sampling variance is high (e.g., in exome sequencing). (2) sigLASSO uses L1 regularization to parsimoniously assign signatures, leading to sparse and interpretable solutions. (3) It fine-tunes model complexity, informed by data scale and biological priors. (4) Consequently, sigLASSO can assess model uncertainty and abstain from making assignments in low-confidence contexts.

SUBMITTER: Li S 

PROVIDER: S-EPMC7368050 | biostudies-literature | 2020 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Using sigLASSO to optimize cancer mutation signatures jointly with sampling likelihood.

Li Shantao S   Crawford Forrest W FW   Gerstein Mark B MB  

Nature communications 20200717 1


Multiple mutational processes drive carcinogenesis, leaving characteristic signatures in tumor genomes. Determining the active signatures from a full repertoire of potential ones helps elucidate mechanisms of cancer development. This involves optimally decomposing the counts of cancer mutations, tabulated according to their trinucleotide context, into a linear combination of known signatures. Here, we develop sigLASSO (a software tool at github.com/gersteinlab/siglasso) to carry out this optimiz  ...[more]

Similar Datasets

| S-EPMC2877550 | biostudies-literature
| S-EPMC3929280 | biostudies-literature
| S-EPMC6402697 | biostudies-literature
| S-EPMC7195458 | biostudies-literature
| S-EPMC4701410 | biostudies-literature
| S-EPMC7758077 | biostudies-literature
| S-EPMC5993214 | biostudies-literature
| S-EPMC7403618 | biostudies-literature
| S-EPMC3954410 | biostudies-literature
| S-EPMC7577508 | biostudies-literature