Unknown

Dataset Information

0

A systematic comparison of chloroplast genome assembly tools.


ABSTRACT:

Background

Chloroplasts are intracellular organelles that enable plants to conduct photosynthesis. They arose through the symbiotic integration of a prokaryotic cell into an eukaryotic host cell and still contain their own genomes with distinct genomic information. Plastid genomes accommodate essential genes and are regularly utilized in biotechnology or phylogenetics. Different assemblers that are able to assess the plastid genome have been developed. These assemblers often use data of whole genome sequencing experiments, which usually contain reads from the complete chloroplast genome.

Results

The performance of different assembly tools has never been systematically compared. Here, we present a benchmark of seven chloroplast assembly tools, capable of succeeding in more than 60% of known real data sets. Our results show significant differences between the tested assemblers in terms of generating whole chloroplast genome sequences and computational requirements. The examination of 105 data sets from species with unknown plastid genomes leads to the assembly of 20 novel chloroplast genomes.

Conclusions

We create docker images for each tested tool that are freely available for the scientific community and ensure reproducibility of the analyses. These containers allow the analysis and screening of data sets for chloroplast genomes using standard computational infrastructure. Thus, large scale screening for chloroplasts within genomic sequencing data is feasible.

SUBMITTER: Freudenthal JA 

PROVIDER: S-EPMC7520963 | biostudies-literature | 2020 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

A systematic comparison of chloroplast genome assembly tools.

Freudenthal Jan A JA   Pfaff Simon S   Terhoeven Niklas N   Korte Arthur A   Ankenbrand Markus J MJ   Förster Frank F  

Genome biology 20200928 1


<h4>Background</h4>Chloroplasts are intracellular organelles that enable plants to conduct photosynthesis. They arose through the symbiotic integration of a prokaryotic cell into an eukaryotic host cell and still contain their own genomes with distinct genomic information. Plastid genomes accommodate essential genes and are regularly utilized in biotechnology or phylogenetics. Different assemblers that are able to assess the plastid genome have been developed. These assemblers often use data of  ...[more]

Similar Datasets

| S-EPMC4201551 | biostudies-literature
| S-EPMC6865063 | biostudies-literature
| S-EPMC3056720 | biostudies-literature
| S-EPMC333770 | biostudies-other
| S-EPMC5704237 | biostudies-literature
| PRJEB421 | ENA
| PRJEB420 | ENA
2023-03-01 | GSE195618 | GEO
| PRJNA507697 | ENA
| S-EPMC10548655 | biostudies-literature