Dataset Information

SeMPI: a genome-based secondary metabolite prediction and identification web server.

ABSTRACT: The secondary metabolism of bacteria, fungi and plants yields a vast number of bioactive substances. The constantly increasing amount of published genomic data provides the opportunity for an efficient identification of gene clusters by genome mining. Conversely, for many natural products with resolved structures, the encoding gene clusters have not been identified yet. Even though genome mining tools have become significantly more efficient in the identification of biosynthetic gene clusters, structural elucidation of the actual secondary metabolite is still challenging, especially due to as yet unpredictable post-modifications. Here, we introduce SeMPI, a web server providing a prediction and identification pipeline for natural products synthesized by polyketide synthases of type I modular. In order to limit the possible structures of PKS products and to include putative tailoring reactions, a structural comparison with annotated natural products was introduced. Furthermore, a benchmark was designed based on 40 gene clusters with annotated PKS products. The web server of the pipeline (SeMPI) is freely available at: http://www.pharmaceutical-bioinformatics.de/sempi.

SUBMITTER: Zierep PF

PROVIDER: S-EPMC5570227 | biostudies-literature | 2017 Jul

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

SeMPI: a genome-based secondary metabolite prediction and identification web server.

Zierep Paul F PF Padilla Natàlia N Yonchev Dimitar G DG Telukunta Kiran K KK Klementz Dennis D Günther Stefan S

Nucleic acids research 20170701 W1

The secondary metabolism of bacteria, fungi and plants yields a vast number of bioactive substances. The constantly increasing amount of published genomic data provides the opportunity for an efficient identification of gene clusters by genome mining. Conversely, for many natural products with resolved structures, the encoding gene clusters have not been identified yet. Even though genome mining tools have become significantly more efficient in the identification of biosynthetic gene clusters, s ...[more]

PMID: 28453782

Dataset Information

SeMPI: a genome-based secondary metabolite prediction and identification web server.

Publications

SeMPI: a genome-based secondary metabolite prediction and identification web server.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

CENTROIDFOLD: a web server for RNA secondary structure prediction.
| S-EPMC2703931 | biostudies-literature

RDfolder: a web server for prediction of RNA secondary structure.
| S-EPMC441583 | biostudies-literature

cRNAsp12 Web Server for the Prediction of Circular RNA Secondary Structures and Stabilities
| S-EPMC9959564 | biostudies-literature

Viral IRES prediction system - a web server for prediction of the IRES secondary structure in silico.
| S-EPMC3818432 | biostudies-literature

MISA-web: a web server for microsatellite prediction.
| S-EPMC5870701 | biostudies-literature

KCD: A prediction web server of knowledge-based circular dichroism.
| S-EPMC10966356 | biostudies-literature

LINbase: a web server for genome-based identification of prokaryotes as members of crowdsourced taxa.
| S-EPMC7319462 | biostudies-literature

PhotoModPlus: A web server for photosynthetic protein prediction from genome neighborhood features.
| S-EPMC7968678 | biostudies-literature

systemsDock: a web server for network pharmacology-based prediction and analysis.
| S-EPMC4987901 | biostudies-literature

G4Hunter web application: a web server for G-quadruplex prediction.
| S-EPMC6748775 | biostudies-literature