Unknown

Dataset Information

0

MAD Bayes for Tumor Heterogeneity - Feature Allocation with Exponential Family Sampling.


ABSTRACT: We propose small-variance asymptotic approximations for inference on tumor heterogeneity (TH) using next-generation sequencing data. Understanding TH is an important and open research problem in biology. The lack of appropriate statistical inference is a critical gap in existing methods that the proposed approach aims to fill. We build on a hierarchical model with an exponential family likelihood and a feature allocation prior. The proposed implementation of posterior inference generalizes similar small-variance approximations proposed by Kulis and Jordan (2012) and Broderick et.al (2012b) for inference with Dirichlet process mixture and Indian buffet process prior models under normal sampling. We show that the new algorithm can successfully recover latent structures of different haplotypes and subclones and is magnitudes faster than available Markov chain Monte Carlo samplers. The latter are practically infeasible for high-dimensional genomics data. The proposed approach is scalable, easy to implement and benefits from the exibility of Bayesian nonparametric models. More importantly, it provides a useful tool for applied scientists to estimate cell subtypes in tumor samples. R code is available on http://www.ma.utexas.edu/users/yxu/.

SUBMITTER: Xu Y 

PROVIDER: S-EPMC4498588 | biostudies-literature | 2015 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

MAD Bayes for Tumor Heterogeneity - Feature Allocation with Exponential Family Sampling.

Xu Yanxun Y   Müller Peter P   Yuan Yuan Y   Gulukota Kamalakar K   Ji Yuan Y  

Journal of the American Statistical Association 20150301 510


We propose small-variance asymptotic approximations for inference on tumor heterogeneity (TH) using next-generation sequencing data. Understanding TH is an important and open research problem in biology. The lack of appropriate statistical inference is a critical gap in existing methods that the proposed approach aims to fill. We build on a hierarchical model with an exponential family likelihood and a feature allocation prior. The proposed implementation of posterior inference generalizes simil  ...[more]

Similar Datasets

| S-EPMC4498414 | biostudies-literature
| S-EPMC8281651 | biostudies-literature
| S-EPMC4758238 | biostudies-literature
| S-EPMC6853711 | biostudies-literature
| S-EPMC7472451 | biostudies-literature
2021-06-08 | GSE175769 | GEO
| S-EPMC4858653 | biostudies-other
| S-EPMC2654709 | biostudies-literature
| S-EPMC3005920 | biostudies-literature