Unknown

Dataset Information

0

TreeSAPP: the Tree-based Sensitive and Accurate Phylogenetic Profiler.


ABSTRACT: MOTIVATION:Microbial communities drive matter and energy transformations integral to global biogeochemical cycles, yet many taxonomic groups facilitating these processes remain poorly represented in biological sequence databases. Due to this missing information, taxonomic assignment of sequences from environmental genomes remains inaccurate. RESULTS:We present the Tree-based Sensitive and Accurate Phylogenetic Profiler (TreeSAPP) software for functionally and taxonomically classifying genes, reactions and pathways from genomes of cultivated and uncultivated microorganisms using reference packages representing coding sequences mediating multiple globally relevant biogeochemical cycles. TreeSAPP uses linear regression of evolutionary distance on taxonomic rank to improve classifications, assigning both closely related and divergent query sequences at the appropriate taxonomic rank. TreeSAPP is able to provide quantitative functional and taxonomic classifications for both assembled and unassembled sequences and files supporting interactive tree of life visualizations. AVAILABILITY AND IMPLEMENTATION:TreeSAPP was developed in Python 3 as an open-source Python package and is available on GitHub at https://github.com/hallamlab/TreeSAPP. SUPPLEMENTARY INFORMATION:Supplementary data are available at Bioinformatics online.

SUBMITTER: Morgan-Lang C 

PROVIDER: S-EPMC7695126 | biostudies-literature | 2020 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

TreeSAPP: the Tree-based Sensitive and Accurate Phylogenetic Profiler.

Morgan-Lang Connor C   McLaughlin Ryan R   Armstrong Zachary Z   Zhang Grace G   Chan Kevin K   Hallam Steven J SJ  

Bioinformatics (Oxford, England) 20200901 18


<h4>Motivation</h4>Microbial communities drive matter and energy transformations integral to global biogeochemical cycles, yet many taxonomic groups facilitating these processes remain poorly represented in biological sequence databases. Due to this missing information, taxonomic assignment of sequences from environmental genomes remains inaccurate.<h4>Results</h4>We present the Tree-based Sensitive and Accurate Phylogenetic Profiler (TreeSAPP) software for functionally and taxonomically classif  ...[more]

Similar Datasets

| S-EPMC4130513 | biostudies-literature
2015-01-08 | GSE54339 | GEO
2015-01-08 | E-GEOD-54339 | biostudies-arrayexpress
| S-EPMC38248 | biostudies-other
| S-EPMC3466146 | biostudies-literature
| S-EPMC9236582 | biostudies-literature
| S-EPMC5123309 | biostudies-literature
| S-EPMC3669295 | biostudies-literature
| S-EPMC4082356 | biostudies-literature
| S-EPMC4143915 | biostudies-literature