Dataset Information

Automated deconvolution of structured mixtures from heterogeneous tumor genomic data.

ABSTRACT: With increasing appreciation for the extent and importance of intratumor heterogeneity, much attention in cancer research has focused on profiling heterogeneity on a single patient level. Although true single-cell genomic technologies are rapidly improving, they remain too noisy and costly at present for population-level studies. Bulk sequencing remains the standard for population-scale tumor genomics, creating a need for computational tools to separate contributions of multiple tumor clones and assorted stromal and infiltrating cell populations to pooled genomic data. All such methods are limited to coarse approximations of only a few cell subpopulations, however. In prior work, we demonstrated the feasibility of improving cell type deconvolution by taking advantage of substructure in genomic mixtures via a strategy called simplicial complex unmixing. We improve on past work by introducing enhancements to automate learning of substructured genomic mixtures, with specific emphasis on genome-wide copy number variation (CNV) data, as well as the ability to process quantitative RNA expression data, and heterogeneous combinations of RNA and CNV data. We introduce methods for dimensionality estimation to better decompose mixture model substructure; fuzzy clustering to better identify substructure in sparse, noisy data; and automated model inference methods for other key model parameters. We further demonstrate their effectiveness in identifying mixture substructure in true breast cancer CNV data from the Cancer Genome Atlas (TCGA). Source code is available at https://github.com/tedroman/WSCUnmix.

SUBMITTER: Roman T

PROVIDER: S-EPMC5695636 | biostudies-literature | 2017 Oct

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Automated deconvolution of structured mixtures from heterogeneous tumor genomic data.

Roman Theodore T Xie Lu L Schwartz Russell R

PLoS computational biology 20171023 10

With increasing appreciation for the extent and importance of intratumor heterogeneity, much attention in cancer research has focused on profiling heterogeneity on a single patient level. Although true single-cell genomic technologies are rapidly improving, they remain too noisy and costly at present for population-level studies. Bulk sequencing remains the standard for population-scale tumor genomics, creating a need for computational tools to separate contributions of multiple tumor clones and ...[more]

PMID: 29059177

Dataset Information

Automated deconvolution of structured mixtures from heterogeneous tumor genomic data.

Publications

Automated deconvolution of structured mixtures from heterogeneous tumor genomic data.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Deconvolution of heterogeneous tumor samples using partial reference signals.
| S-EPMC7728196 | biostudies-literature

Transcriptome Deconvolution of Heterogeneous Tumor Samples with Immune Infiltration.
| S-EPMC6249353 | biostudies-literature

AI-Based Automated Lipomatous Tumor Segmentation in MR Images: Ensemble Solution to Heterogeneous Data.
| S-EPMC10287587 | biostudies-literature

Phenotypic deconvolution in heterogeneous cancer cell populations using drug-screening data.
| S-EPMC10088094 | biostudies-literature

CDSeq: A novel complete deconvolution method for dissecting heterogeneous samples using gene expression data.
| S-EPMC6907860 | biostudies-literature

Deconvolution of tumor composition using partially available DNA methylation data.
| S-EPMC9400327 | biostudies-literature

A structured multi-head attention prediction method based on heterogeneous financial data.
| S-EPMC10703059 | biostudies-literature

Structured Matrix Completion with Applications to Genomic Data Integration.
| S-EPMC5198844 | biostudies-literature

A statistical approach for detecting genomic aberrations in heterogeneous tumor samples from single nucleotide polymorphism genotyping data.
| S-EPMC2965384 | biostudies-literature