Unknown

Dataset Information

0

PGA: a software package for rapid, accurate, and flexible batch annotation of plastomes.


ABSTRACT: Background:Plastome (plastid genome) sequences provide valuable information for understanding the phylogenetic relationships and evolutionary history of plants. Although the rapid development of high-throughput sequencing technology has led to an explosion of plastome sequences, annotation remains a significant bottleneck for plastomes. User-friendly batch annotation of multiple plastomes is an urgent need. Results:We introduce Plastid Genome Annotator (PGA), a standalone command line tool that can perform rapid, accurate, and flexible batch annotation of newly generated target plastomes based on well-annotated reference plastomes. In contrast to current existing tools, PGA uses reference plastomes as the query and unannotated target plastomes as the subject to locate genes, which we refer to as the reverse query-subject BLAST search approach. PGA accurately identifies gene and intron boundaries as well as intron loss. The program outputs GenBank-formatted files as well as a log file to assist users in verifying annotations. Comparisons against other available plastome annotation tools demonstrated the high annotation accuracy of PGA, with little or no post-annotation verification necessary. Likewise, we demonstrated the flexibility of reference plastomes within PGA by annotating the plastome of Rosa roxburghii using that of Amborella trichopoda as a reference. The program, user manual and example data sets are freely available at https://github.com/quxiaojian/PGA. Conclusions:PGA facilitates rapid, accurate, and flexible batch annotation of plastomes across plants. For projects in which multiple plastomes are generated, the time savings for high-quality plastome annotation are especially significant.

SUBMITTER: Qu XJ 

PROVIDER: S-EPMC6528300 | biostudies-literature | 2019

REPOSITORIES: biostudies-literature

altmetric image

Publications

PGA: a software package for rapid, accurate, and flexible batch annotation of plastomes.

Qu Xiao-Jian XJ   Moore Michael J MJ   Li De-Zhu DZ   Yi Ting-Shuang TS  

Plant methods 20190521


<h4>Background</h4>Plastome (plastid genome) sequences provide valuable information for understanding the phylogenetic relationships and evolutionary history of plants. Although the rapid development of high-throughput sequencing technology has led to an explosion of plastome sequences, annotation remains a significant bottleneck for plastomes. User-friendly batch annotation of multiple plastomes is an urgent need.<h4>Results</h4>We introduce Plastid Genome Annotator (PGA), a standalone command  ...[more]

Similar Datasets

| S-EPMC3735449 | biostudies-literature
| S-EPMC2770656 | biostudies-literature
| S-EPMC1919397 | biostudies-literature
| S-EPMC1318494 | biostudies-literature
| S-EPMC8098023 | biostudies-literature
| S-EPMC5884839 | biostudies-other
| S-EPMC5362640 | biostudies-literature
| S-EPMC9749704 | biostudies-literature
| S-EPMC3875567 | biostudies-literature
2020-12-05 | GSE162690 | GEO