Unknown

Dataset Information

0

Snowball: resampling combined with distance-based regression to discover transcriptional consequences of a driver mutation.


ABSTRACT:

Motivation

Large-scale cancer genomic studies, such as The Cancer Genome Atlas (TCGA), have profiled multidimensional genomic data, including mutation and expression profiles on a variety of cancer cell types, to uncover the molecular mechanism of cancerogenesis. More than a hundred driver mutations have been characterized that confer the advantage of cell growth. However, how driver mutations regulate the transcriptome to affect cellular functions remains largely unexplored. Differential analysis of gene expression relative to a driver mutation on patient samples could provide us with new insights in understanding driver mutation dysregulation in tumor genome and developing personalized treatment strategies.

Results

Here, we introduce the Snowball approach as a highly sensitive statistical analysis method to identify transcriptional signatures that are affected by a recurrent driver mutation. Snowball utilizes a resampling-based approach and combines a distance-based regression framework to assign a robust ranking index of genes based on their aggregated association with the presence of the mutation, and further selects the top significant genes for downstream data analyses or experiments. In our application of the Snowball approach to both synthesized and TCGA data, we demonstrated that it outperforms the standard methods and provides more accurate inferences to the functional effects and transcriptional dysregulation of driver mutations.

Availability and implementation

R package and source code are available from CRAN at http://cran.r-project.org/web/packages/DESnowball, and also available at http://bioinfo.mc.vanderbilt.edu/DESnowball/.

SUBMITTER: Xu Y 

PROVIDER: S-EPMC4271146 | biostudies-literature | 2015 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Snowball: resampling combined with distance-based regression to discover transcriptional consequences of a driver mutation.

Xu Yaomin Y   Guo Xingyi X   Sun Jiayang J   Zhao Zhongming Z  

Bioinformatics (Oxford, England) 20140905 1


<h4>Motivation</h4>Large-scale cancer genomic studies, such as The Cancer Genome Atlas (TCGA), have profiled multidimensional genomic data, including mutation and expression profiles on a variety of cancer cell types, to uncover the molecular mechanism of cancerogenesis. More than a hundred driver mutations have been characterized that confer the advantage of cell growth. However, how driver mutations regulate the transcriptome to affect cellular functions remains largely unexplored. Differentia  ...[more]

Similar Datasets

| S-EPMC6555848 | biostudies-literature
| S-EPMC4987284 | biostudies-literature
| S-EPMC6941958 | biostudies-literature
| S-EPMC26445 | biostudies-literature
| S-EPMC6928252 | biostudies-literature
| S-EPMC5683086 | biostudies-literature
2023-02-08 | GSE222348 | GEO
| S-EPMC6895256 | biostudies-literature
| S-EPMC8265092 | biostudies-literature
| S-EPMC3411958 | biostudies-literature