Unknown

Dataset Information

0

SeMPI: a genome-based secondary metabolite prediction and identification web server.


ABSTRACT: The secondary metabolism of bacteria, fungi and plants yields a vast number of bioactive substances. The constantly increasing amount of published genomic data provides the opportunity for an efficient identification of gene clusters by genome mining. Conversely, for many natural products with resolved structures, the encoding gene clusters have not been identified yet. Even though genome mining tools have become significantly more efficient in the identification of biosynthetic gene clusters, structural elucidation of the actual secondary metabolite is still challenging, especially due to as yet unpredictable post-modifications. Here, we introduce SeMPI, a web server providing a prediction and identification pipeline for natural products synthesized by polyketide synthases of type I modular. In order to limit the possible structures of PKS products and to include putative tailoring reactions, a structural comparison with annotated natural products was introduced. Furthermore, a benchmark was designed based on 40 gene clusters with annotated PKS products. The web server of the pipeline (SeMPI) is freely available at: http://www.pharmaceutical-bioinformatics.de/sempi.

SUBMITTER: Zierep PF 

PROVIDER: S-EPMC5570227 | biostudies-literature | 2017 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

SeMPI: a genome-based secondary metabolite prediction and identification web server.

Zierep Paul F PF   Padilla Natàlia N   Yonchev Dimitar G DG   Telukunta Kiran K KK   Klementz Dennis D   Günther Stefan S  

Nucleic acids research 20170701 W1


The secondary metabolism of bacteria, fungi and plants yields a vast number of bioactive substances. The constantly increasing amount of published genomic data provides the opportunity for an efficient identification of gene clusters by genome mining. Conversely, for many natural products with resolved structures, the encoding gene clusters have not been identified yet. Even though genome mining tools have become significantly more efficient in the identification of biosynthetic gene clusters, s  ...[more]

Similar Datasets

| S-EPMC2703931 | biostudies-literature
| S-EPMC441583 | biostudies-literature
| S-EPMC4987901 | biostudies-other
| S-EPMC4647888 | biostudies-other
| S-EPMC3818432 | biostudies-literature
| S-EPMC5870701 | biostudies-literature
| S-EPMC2703887 | biostudies-other
| S-EPMC1538802 | biostudies-other
| S-EPMC7319462 | biostudies-literature
| S-EPMC7968678 | biostudies-literature