Unknown

Dataset Information

0

REMap: Operon map of M. tuberculosis based on RNA sequence data.


ABSTRACT: A map of the transcriptional organization of genes of an organism is a basic tool that is necessary to understand and facilitate a more accurate genetic manipulation of the organism. Operon maps are largely generated by computational prediction programs that rely on gene conservation and genome architecture and may not be physiologically relevant. With the widespread use of RNA sequencing (RNAseq), the prediction of operons based on actual transcriptome sequencing rather than computational genomics alone is much needed. Here, we report a validated operon map of Mycobacterium tuberculosis, developed using RNAseq data from both the exponential and stationary phases of growth. At least 58.4% of M. tuberculosis genes are organized into 749 operons. Our prediction algorithm, REMap (RNA Expression Mapping of operons), considers the many cases of transcription coverage of intergenic regions, and avoids dependencies on functional annotation and arbitrary assumptions about gene structure. As a result, we demonstrate that REMap is able to more accurately predict operons, especially those that contain long intergenic regions or functionally unrelated genes, than previous operon prediction programs. The REMap algorithm is publicly available as a user-friendly tool that can be readily modified to predict operons in other bacteria.

SUBMITTER: Pelly S 

PROVIDER: S-EPMC4967370 | biostudies-literature | 2016 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

REMap: Operon map of M. tuberculosis based on RNA sequence data.

Pelly Shaaretha S   Winglee Kathryn K   Xia Fang Fang FF   Stevens Rick L RL   Bishai William R WR   Lamichhane Gyanu G  

Tuberculosis (Edinburgh, Scotland) 20160429


A map of the transcriptional organization of genes of an organism is a basic tool that is necessary to understand and facilitate a more accurate genetic manipulation of the organism. Operon maps are largely generated by computational prediction programs that rely on gene conservation and genome architecture and may not be physiologically relevant. With the widespread use of RNA sequencing (RNAseq), the prediction of operons based on actual transcriptome sequencing rather than computational genom  ...[more]

Similar Datasets

| S-EPMC1976454 | biostudies-literature
| S-EPMC3629779 | biostudies-literature
| S-EPMC7595951 | biostudies-literature
| S-EPMC1780048 | biostudies-literature
| S-EPMC1283447 | biostudies-literature
| S-EPMC3207917 | biostudies-literature
| S-EPMC6092779 | biostudies-literature
| S-EPMC5753293 | biostudies-literature
| S-EPMC3348046 | biostudies-literature
| S-EPMC8004444 | biostudies-literature