Unknown

Dataset Information

0

PanACoTA: a modular tool for massive microbial comparative genomics.


ABSTRACT: The study of the gene repertoires of microbial species, their pangenomes, has become a key part of microbial evolution and functional genomics. Yet, the increasing number of genomes available complicates the establishment of the basic building blocks of comparative genomics. Here, we present PanACoTA (https://github.com/gem-pasteur/PanACoTA), a tool that allows to download all genomes of a species, build a database with those passing quality and redundancy controls, uniformly annotate and then build their pangenome, several variants of core genomes, their alignments and a rapid but accurate phylogenetic tree. While many programs building pangenomes have become available in the last few years, we have focused on a modular method, that tackles all the key steps of the process, from download to phylogenetic inference. While all steps are integrated, they can also be run separately and multiple times to allow rapid and extensive exploration of the parameters of interest. PanACoTA is built in Python3, includes a singularity container and features to facilitate its future development. We believe PanACoTa is an interesting addition to the current set of comparative genomics tools, since it will accelerate and standardize the more routine parts of the work, allowing microbial genomicists to more quickly tackle their specific questions.

SUBMITTER: Perrin A 

PROVIDER: S-EPMC7803007 | biostudies-literature | 2021 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

PanACoTA: a modular tool for massive microbial comparative genomics.

Perrin Amandine A   Rocha Eduardo P C EPC  

NAR genomics and bioinformatics 20210112 1


The study of the gene repertoires of microbial species, their pangenomes, has become a key part of microbial evolution and functional genomics. Yet, the increasing number of genomes available complicates the establishment of the basic building blocks of comparative genomics. Here, we present PanACoTA (https://github.com/gem-pasteur/PanACoTA), a tool that allows to download all genomes of a species, build a database with those passing quality and redundancy controls, uniformly annotate and then b  ...[more]

Similar Datasets

| S-EPMC2967562 | biostudies-literature
| S-EPMC331400 | biostudies-literature
| S-EPMC2738131 | biostudies-literature
| S-EPMC3394297 | biostudies-literature
| S-EPMC3233612 | biostudies-literature
| S-EPMC10167986 | biostudies-literature
| S-EPMC4603748 | biostudies-literature
| S-EPMC112885 | biostudies-literature
| S-EPMC11371463 | biostudies-literature
| S-EPMC8715433 | biostudies-literature