Unknown

Dataset Information

0

Scalable inference of transcriptional kinetic parameters from MS2 time series data.


ABSTRACT:

Motivation

The MS2-MCP (MS2 coat protein) live imaging system allows for visualisation of transcription dynamics through the introduction of hairpin stem-loop sequences into a gene. A fluorescent signal at the site of nascent transcription in the nucleus quantifies mRNA production. Computational modelling can be used to infer the promoter states along with the kinetic parameters governing transcription, such as promoter switching frequency and polymerase loading rate. However, modelling of the fluorescent trace presents a challenge due its persistence; the observed fluorescence at a given time point depends on both current and previous promoter states. A compound state Hidden Markov Model (cpHMM) was recently introduced to allow inference of promoter activity from MS2-MCP data. However, the computational time for inference scales exponentially with gene length and the cpHMM is therefore not currently practical for application to many eukaryotic genes.

Results

We present a scalable implementation of the cpHMM for fast inference of promoter activity and transcriptional kinetic parameters. This new method can model genes of arbitrary length through the use of a time-adaptive truncated compound state space. The truncated state space provides a good approximation to the full state space by retaining the most likely set of states at each time during the forward pass of the algorithm. Testing on MS2-MCP fluorescent data collected from early Drosophila melanogaster embryos indicates that the method provides accurate inference of kinetic parameters within a computationally feasible timeframe. The inferred promoter traces generated by the model can also be used to infer single-cell transcriptional parameters.

Availability

Python implementation available at https://github.com/ManchesterBioinference/burstInfer, along with code to reproduce the examples presented here.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Bowles JR 

PROVIDER: S-EPMC8796374 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC10237648 | biostudies-literature
| S-EPMC3114728 | biostudies-literature
| S-EPMC6698887 | biostudies-literature
| S-EPMC7875426 | biostudies-literature
2021-02-22 | GSE144663 | GEO
| S-EPMC5860468 | biostudies-other
| S-EPMC7505465 | biostudies-literature
| S-EPMC6555871 | biostudies-literature
2021-02-22 | GSE144662 | GEO
2021-02-22 | GSE144660 | GEO