Unknown

Dataset Information

0

PatternMarkers & GWCoGAPS for novel data-driven biomarkers via whole transcriptome NMF.


ABSTRACT: Non-negative Matrix Factorization (NMF) algorithms associate gene expression with biological processes (e.g. time-course dynamics or disease subtypes). Compared with univariate associations, the relative weights of NMF solutions can obscure biomarkers. Therefore, we developed a novel patternMarkers statistic to extract genes for biological validation and enhanced visualization of NMF results. Finding novel and unbiased gene markers with patternMarkers requires whole-genome data. Therefore, we also developed Genome-Wide CoGAPS Analysis in Parallel Sets (GWCoGAPS), the first robust whole genome Bayesian NMF using the sparse, MCMC algorithm, CoGAPS. Additionally, a manual version of the GWCoGAPS algorithm contains analytic and visualization tools including patternMatcher, a Shiny web application. The decomposition in the manual pipeline can be replaced with any NMF algorithm, for further generalization of the software. Using these tools, we find granular brain-region and cell-type specific signatures with corresponding biomarkers in GTEx data, illustrating GWCoGAPS and patternMarkers ascertainment of data-driven biomarkers from whole-genome data.PatternMarkers & GWCoGAPS are in the CoGAPS Bioconductor package (3.5) under the GPL license.gsteinobrien@jhmi.edu or ccolantu@jhmi.edu or ejfertig@jhmi.edu.Supplementary data are available at Bioinformatics online.

SUBMITTER: Stein-O'Brien GL 

PROVIDER: S-EPMC5860188 | biostudies-literature | 2017 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications


<h4>Summary</h4>Non-negative Matrix Factorization (NMF) algorithms associate gene expression with biological processes (e.g. time-course dynamics or disease subtypes). Compared with univariate associations, the relative weights of NMF solutions can obscure biomarkers. Therefore, we developed a novel patternMarkers statistic to extract genes for biological validation and enhanced visualization of NMF results. Finding novel and unbiased gene markers with patternMarkers requires whole-genome data.  ...[more]

Similar Datasets

| S-EPMC7647097 | biostudies-literature
| S-EPMC6305402 | biostudies-literature
| S-EPMC10883375 | biostudies-literature
| S-EPMC6394389 | biostudies-literature
| S-EPMC5724404 | biostudies-literature
| S-EPMC11336660 | biostudies-literature
| S-EPMC3142167 | biostudies-literature
| S-EPMC10543961 | biostudies-literature
| S-EPMC7571410 | biostudies-literature
| S-EPMC8510447 | biostudies-literature