Unknown

Dataset Information

0

Taxonomic weighting improves the accuracy of a gap-filling algorithm for metabolic models.


ABSTRACT: MOTIVATION:The increasing availability of annotated genome sequences enables construction of genome-scale metabolic networks, which are useful tools for studying organisms of interest. However, due to incomplete genome annotations, draft metabolic models contain gaps that must be filled in a time-consuming process before they are usable. Optimization-based algorithms that fill these gaps have been developed, however, gap-filling algorithms show significant error rates and often introduce incorrect reactions. RESULTS:Here, we present a new gap-filling method that computes the costs of candidate gap-filling reactions from a universal reaction database (MetaCyc) based on taxonomic information. When gap-filling a metabolic model for an organism M (such as Escherichia coli), the cost for reaction R is based on the frequency with which R occurs in other organisms within the phylum of M (in this case, Proteobacteria). The assumption behind this method is that different taxonomic groups are biased toward using different metabolic reactions. Evaluation of the new gap-filler on randomly degraded variants of the EcoCyc metabolic model for E.coli showed an increase in the average F1-score to 99.0 (when using the variable weights by frequency method at the phylum level), compared to 91.0 using the previous MetaFlux gap-filler and 80.3 using a basic gap-filler. Evaluation on two other microbial metabolic models showed similar improvements. AVAILABILITY AND IMPLEMENTATION:The Pathway Tools software (including MetaFlux) is free for academic use and is available at http://pathwaytools.com. Additional code for reproducing the results presented here is available at www.ai.sri.com/pkarp/pubs/taxgap/supplementary.zip. SUPPLEMENTARY INFORMATION:Supplementary data are available at Bioinformatics online.

SUBMITTER: Ong WK 

PROVIDER: S-EPMC7523652 | biostudies-literature | 2020 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Taxonomic weighting improves the accuracy of a gap-filling algorithm for metabolic models.

Ong Wai Kit WK   Midford Peter E PE   Karp Peter D PD  

Bioinformatics (Oxford, England) 20200301 6


<h4>Motivation</h4>The increasing availability of annotated genome sequences enables construction of genome-scale metabolic networks, which are useful tools for studying organisms of interest. However, due to incomplete genome annotations, draft metabolic models contain gaps that must be filled in a time-consuming process before they are usable. Optimization-based algorithms that fill these gaps have been developed, however, gap-filling algorithms show significant error rates and often introduce  ...[more]

Similar Datasets

| S-EPMC6006690 | biostudies-literature
| S-EPMC8584699 | biostudies-literature
| S-EPMC4147887 | biostudies-other
| S-EPMC4199484 | biostudies-literature
| S-EPMC7427806 | biostudies-literature
| S-EPMC4392588 | biostudies-literature
| S-EPMC4203746 | biostudies-literature
| S-EPMC5302834 | biostudies-literature
| S-EPMC6149537 | biostudies-literature