Unknown

Dataset Information

0

CancerInSilico: An R/Bioconductor package for combining mathematical and statistical modeling to simulate time course bulk and single cell gene expression data in cancer.


ABSTRACT: Bioinformatics techniques to analyze time course bulk and single cell omics data are advancing. The absence of a known ground truth of the dynamics of molecular changes challenges benchmarking their performance on real data. Realistic simulated time-course datasets are essential to assess the performance of time course bioinformatics algorithms. We develop an R/Bioconductor package, CancerInSilico, to simulate bulk and single cell transcriptional data from a known ground truth obtained from mathematical models of cellular systems. This package contains a general R infrastructure for running cell-based models and simulating gene expression data based on the model states. We show how to use this package to simulate a gene expression data set and consequently benchmark analysis methods on this data set with a known ground truth. The package is freely available via Bioconductor: http://bioconductor.org/packages/CancerInSilico/.

SUBMITTER: Sherman TD 

PROVIDER: S-EPMC6504085 | biostudies-literature | 2018 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

CancerInSilico: An R/Bioconductor package for combining mathematical and statistical modeling to simulate time course bulk and single cell gene expression data in cancer.

Sherman Thomas D TD   Kagohara Luciane T LT   Cao Raymon R   Cheng Raymond R   Satriano Matthew M   Considine Michael M   Krigsfeld Gabriel G   Ranaweera Ruchira R   Tang Yong Y   Jablonski Sandra A SA   Stein-O'Brien Genevieve G   Gaykalova Daria A DA   Weiner Louis M LM   Chung Christine H CH   Fertig Elana J EJ  

PLoS computational biology 20180601 4


Bioinformatics techniques to analyze time course bulk and single cell omics data are advancing. The absence of a known ground truth of the dynamics of molecular changes challenges benchmarking their performance on real data. Realistic simulated time-course datasets are essential to assess the performance of time course bioinformatics algorithms. We develop an R/Bioconductor package, CancerInSilico, to simulate bulk and single cell transcriptional data from a known ground truth obtained from math  ...[more]

Similar Datasets

2020-07-09 | GSE114375 | GEO
| PRJNA471189 | ENA
| S-EPMC7674939 | biostudies-literature
| S-EPMC4155246 | biostudies-literature
| S-EPMC3799478 | biostudies-literature
| S-EPMC8156495 | biostudies-literature
| S-EPMC4595899 | biostudies-other
2016-03-16 | E-GEOD-79208 | biostudies-arrayexpress
2016-03-16 | GSE79208 | GEO
| S-EPMC7583374 | biostudies-literature