Unknown

Dataset Information

0

Tumor phylogeny inference using tree-constrained importance sampling.


ABSTRACT:

Motivation

A tumor arises from an evolutionary process that can be modeled as a phylogenetic tree. However, reconstructing this tree is challenging as most cancer sequencing uses bulk tumor tissue containing heterogeneous mixtures of cells.

Results

We introduce P robabilistic A lgorithm for S omatic Tr ee I nference (PASTRI), a new algorithm for bulk-tumor sequencing data that clusters somatic mutations into clones and infers a phylogenetic tree that describes the evolutionary history of the tumor. PASTRI uses an importance sampling algorithm that combines a probabilistic model of DNA sequencing data with a enumeration algorithm based on the combinatorial constraints defined by the underlying phylogenetic tree. As a result, tree inference is fast, accurate and robust to noise. We demonstrate on simulated data that PASTRI outperforms other cancer phylogeny algorithms in terms of runtime and accuracy. On real data from a chronic lymphocytic leukemia (CLL) patient, we show that a simple linear phylogeny better explains the data the complex branching phylogeny that was previously reported. PASTRI provides a robust approach for phylogenetic tree inference from mixed samples.

Availability and implementation

Software is available at compbio.cs.brown.edu/software.

Contact

braphael@princeton.edu.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Satas G 

PROVIDER: S-EPMC5870673 | biostudies-literature | 2017 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Tumor phylogeny inference using tree-constrained importance sampling.

Satas Gryte G   Raphael Benjamin J BJ  

Bioinformatics (Oxford, England) 20170701 14


<h4>Motivation</h4>A tumor arises from an evolutionary process that can be modeled as a phylogenetic tree. However, reconstructing this tree is challenging as most cancer sequencing uses bulk tumor tissue containing heterogeneous mixtures of cells.<h4>Results</h4>We introduce P robabilistic A lgorithm for S omatic Tr ee I nference (PASTRI), a new algorithm for bulk-tumor sequencing data that clusters somatic mutations into clones and infers a phylogenetic tree that describes the evolutionary his  ...[more]

Similar Datasets

| S-EPMC6927103 | biostudies-literature
| S-EPMC7451135 | biostudies-literature
| S-EPMC4211445 | biostudies-literature
| S-EPMC7197229 | biostudies-literature
| S-EPMC7582044 | biostudies-literature
| S-EPMC7160890 | biostudies-literature
| S-EPMC2823708 | biostudies-literature
| S-EPMC6612807 | biostudies-other
| S-EPMC6551234 | biostudies-literature
| S-EPMC7020887 | biostudies-literature