Bioinformatics techniques to analyze time course bulk and single cell omics data are advancing. The absence of a known ground truth of the dynamics of molecular changes challenges benchmarking their performance on real data. Realistic simulated time-course datasets are essential to assess the performance of time course bioinformatics algorithms. We develop an R/Bioconductor package, CancerInSilico, to simulate bulk and single cell transcriptional data from a known ground truth obtained from mathematical models of cellular systems. This package contains a general R infrastructure for running cell-based models and simulating gene expression data based on the model states. We show how to use this package to simulate a gene expression data set and consequently benchmark analysis methods on this data set with a known ground truth. The package is freely available via Bioconductor: http://bioconductor.org/packages/CancerInSilico/.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6504085PMC
http://dx.doi.org/10.1371/journal.pcbi.1006935DOI Listing

Publication Analysis

Top Keywords

time course
12
bulk single
12
single cell
12
gene expression
12
expression data
12
ground truth
12
r/bioconductor package
8
course bulk
8
data set
8
data
7

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!