Unknown

Dataset Information

0

G4PromFinder: an algorithm for predicting transcription promoters in GC-rich bacterial genomes based on AT-rich elements and G-quadruplex motifs.


ABSTRACT: BACKGROUND:Over the last few decades, computational genomics has tremendously contributed to decipher biology from genome sequences and related data. Considerable effort has been devoted to the prediction of transcription promoter and terminator sites that represent the essential "punctuation marks" for DNA transcription. Computational prediction of promoters in prokaryotes is a problem whose solution is far from being determined in computational genomics. The majority of published bacterial promoter prediction tools are based on a consensus-sequences search and they were designed specifically for vegetative ?70 promoters and, therefore, not suitable for promoter prediction in bacteria encoding a lot of ? factors, like actinomycetes. RESULTS:In this study we investigated the possibility to identify putative promoters in prokaryotes based on evolutionarily conserved motifs, and focused our attention on GC-rich bacteria in which promoter prediction with conventional, consensus-based algorithms is often not-exhaustive. Here, we introduce G4PromFinder, a novel algorithm that predicts putative promoters based on AT-rich elements and G-quadruplex DNA motifs. We tested its performances by using available genomic and transcriptomic data of the model microorganisms Streptomyces coelicolor A3(2) and Pseudomonas aeruginosa PA14. We compared our results with those obtained by three currently available promoter predicting algorithms: the ?70consensus-based PePPER, the ? factors consensus-based bTSSfinder, and PromPredict which is based on double-helix DNA stability. Our results demonstrated that G4PromFinder is more suitable than the three reference tools for both the genomes. In fact our algorithm achieved the higher accuracy (F1-scores 0.61 and 0.53 in the two genomes) as compared to the next best tool that is PromPredict (F1-scores 0.46 and 0.48). Consensus-based algorithms produced lower performances with the analyzed GC-rich genomes. CONCLUSIONS:Our analysis shows that G4PromFinder is a powerful tool for promoter search in GC-rich bacteria, especially for bacteria coding for a lot of ? factors, such as the model microorganism S. coelicolor A3(2). Moreover consensus-based tools and, in general, tools that are based on specific features of bacterial ? factors seem to be less performing for promoter prediction in these types of bacterial genomes.

SUBMITTER: Di Salvo M 

PROVIDER: S-EPMC5801747 | biostudies-literature | 2018 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

G4PromFinder: an algorithm for predicting transcription promoters in GC-rich bacterial genomes based on AT-rich elements and G-quadruplex motifs.

Di Salvo Marco M   Pinatel Eva E   Talà Adelfia A   Fondi Marco M   Peano Clelia C   Alifano Pietro P  

BMC bioinformatics 20180206 1


<h4>Background</h4>Over the last few decades, computational genomics has tremendously contributed to decipher biology from genome sequences and related data. Considerable effort has been devoted to the prediction of transcription promoter and terminator sites that represent the essential "punctuation marks" for DNA transcription. Computational prediction of promoters in prokaryotes is a problem whose solution is far from being determined in computational genomics. The majority of published bacte  ...[more]

Similar Datasets

| S-EPMC2412878 | biostudies-literature
| S-EPMC4117428 | biostudies-literature
| S-EPMC2034261 | biostudies-literature
| S-EPMC3000368 | biostudies-literature
| S-EPMC7980453 | biostudies-literature
| S-EPMC7102960 | biostudies-literature
| S-EPMC8153448 | biostudies-literature
| S-EPMC275461 | biostudies-literature
| S-EPMC6598670 | biostudies-literature
| S-EPMC3724673 | biostudies-literature