Unknown

Dataset Information

0

SNPhylo: a pipeline to construct a phylogenetic tree from huge SNP data.


ABSTRACT:

Background

Phylogenetic trees are widely used for genetic and evolutionary studies in various organisms. Advanced sequencing technology has dramatically enriched data available for constructing phylogenetic trees based on single nucleotide polymorphisms (SNPs). However, massive SNP data makes it difficult to perform reliable analysis, and there has been no ready-to-use pipeline to generate phylogenetic trees from these data.

Results

We developed a new pipeline, SNPhylo, to construct phylogenetic trees based on large SNP datasets. The pipeline may enable users to construct a phylogenetic tree from three representative SNP data file formats. In addition, in order to increase reliability of a tree, the pipeline has steps such as removing low quality data and considering linkage disequilibrium. A maximum likelihood method for the inference of phylogeny is also adopted in generation of a tree in our pipeline.

Conclusions

Using SNPhylo, users can easily produce a reliable phylogenetic tree from a large SNP data file. Thus, this pipeline can help a researcher focus more on interpretation of the results of analysis of voluminous data sets, rather than manipulations necessary to accomplish the analysis.

SUBMITTER: Lee TH 

PROVIDER: S-EPMC3945939 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC2432038 | biostudies-literature
| S-EPMC3267785 | biostudies-literature
| S-EPMC1626092 | biostudies-literature
| S-EPMC9116704 | biostudies-literature
| S-EPMC3583733 | biostudies-literature
| S-EPMC6543773 | biostudies-literature
| S-EPMC8256826 | biostudies-literature
| S-EPMC3521233 | biostudies-literature
| S-EPMC8763121 | biostudies-literature
| S-EPMC5146993 | biostudies-literature