Unknown

Dataset Information

0

TREE2FASTA: a flexible Perl script for batch extraction of FASTA sequences from exploratory phylogenetic trees.


ABSTRACT: The body of DNA sequence data lacking taxonomically informative sequence headers is rapidly growing in user and public databases (e.g. sequences lacking identification and contaminants). In the context of systematics studies, sorting such sequence data for taxonomic curation and/or molecular diversity characterization (e.g. crypticism) often requires the building of exploratory phylogenetic trees with reference taxa. The subsequent step of segregating DNA sequences of interest based on observed topological relationships can represent a challenging task, especially for large datasets.We have written TREE2FASTA, a Perl script that enables and expedites the sorting of FASTA-formatted sequence data from exploratory phylogenetic trees. TREE2FASTA takes advantage of the interactive, rapid point-and-click color selection and/or annotations of tree leaves in the popular Java tree-viewer FigTree to segregate groups of FASTA sequences of interest to separate files. TREE2FASTA allows for both simple and nested segregation designs to facilitate the simultaneous preparation of multiple data sets that may overlap in sequence content.

SUBMITTER: Sauvage T 

PROVIDER: S-EPMC5838971 | biostudies-literature | 2018 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

TREE2FASTA: a flexible Perl script for batch extraction of FASTA sequences from exploratory phylogenetic trees.

Sauvage Thomas T   Plouviez Sophie S   Schmidt William E WE   Fredericq Suzanne S  

BMC research notes 20180305 1


<h4>Objective</h4>The body of DNA sequence data lacking taxonomically informative sequence headers is rapidly growing in user and public databases (e.g. sequences lacking identification and contaminants). In the context of systematics studies, sorting such sequence data for taxonomic curation and/or molecular diversity characterization (e.g. crypticism) often requires the building of exploratory phylogenetic trees with reference taxa. The subsequent step of segregating DNA sequences of interest  ...[more]

Similar Datasets

| S-EPMC6705769 | biostudies-literature
| S-EPMC4868591 | biostudies-literature
| S-EPMC6931354 | biostudies-literature
| S-EPMC8302185 | biostudies-literature
| S-EPMC6496087 | biostudies-literature
| S-EPMC5320690 | biostudies-literature
| S-EPMC2330044 | biostudies-literature
| S-EPMC8756197 | biostudies-literature
| S-EPMC1162494 | biostudies-other
| S-EPMC7031779 | biostudies-literature