Unknown

Dataset Information

0

PhySpeTree: an automated pipeline for reconstructing phylogenetic species trees.


ABSTRACT:

Background

Phylogenetic species trees are widely used in inferring evolutionary relationships. Existing software and algorithms mainly focus on phylogenetic inference. However, less attention has been paid to intermediate steps, such as processing extremely large sequences and preparing configure files to connect multiple software. When the species number is large, the intermediate steps become a bottleneck that may seriously affect the efficiency of tree building.

Results

Here, we present an easy-to-use pipeline named PhySpeTree to facilitate the reconstruction of species trees across bacterial, archaeal, and eukaryotic organisms. Users need only to input the abbreviations of species names; PhySpeTree prepares complex configure files for different software, then automatically downloads genomic data, cleans sequences, and builds trees. PhySpeTree allows users to perform critical steps such as sequence alignment and tree construction by adjusting advanced options. PhySpeTree provides two parallel pipelines based on concatenated highly conserved proteins and small subunit ribosomal RNA sequences, respectively. Accessory modules, such as those for inserting new species, generating visualization configurations, and combining trees, are distributed along with PhySpeTree.

Conclusions

Together with accessory modules, PhySpeTree significantly simplifies tree reconstruction. PhySpeTree is implemented in Python running on modern operating systems (Linux, macOS, and Windows). The source code is freely available with detailed documentation (https://github.com/yangfangs/physpetools).

SUBMITTER: Fang Y 

PROVIDER: S-EPMC6889546 | biostudies-literature | 2019 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

PhySpeTree: an automated pipeline for reconstructing phylogenetic species trees.

Fang Yang Y   Liu Chengcheng C   Lin Jiangyi J   Li Xufeng X   Alavian Kambiz N KN   Yang Yi Y   Niu Yulong Y  

BMC evolutionary biology 20191202 1


<h4>Background</h4>Phylogenetic species trees are widely used in inferring evolutionary relationships. Existing software and algorithms mainly focus on phylogenetic inference. However, less attention has been paid to intermediate steps, such as processing extremely large sequences and preparing configure files to connect multiple software. When the species number is large, the intermediate steps become a bottleneck that may seriously affect the efficiency of tree building.<h4>Results</h4>Here, w  ...[more]

Similar Datasets

| S-EPMC9194518 | biostudies-literature
| S-EPMC3194234 | biostudies-literature
| S-EPMC4051759 | biostudies-other
| S-EPMC2228325 | biostudies-literature
| S-EPMC2432038 | biostudies-literature
2015-06-26 | E-GEOD-69184 | biostudies-arrayexpress
| S-EPMC2845580 | biostudies-literature
| S-EPMC3813836 | biostudies-other
| S-EPMC2720955 | biostudies-literature
| S-EPMC4044049 | biostudies-literature