Unknown

Dataset Information

0

Cognac: rapid generation of concatenated gene alignments for phylogenetic inference from large, bacterial whole genome sequencing datasets.


ABSTRACT:

Background

The quantity of genomic data is expanding at an increasing rate. Tools for phylogenetic analysis which scale to the quantity of available data are required. To address this need, we present cognac, a user-friendly software package to rapidly generate concatenated gene alignments for phylogenetic analysis.

Results

We illustrate that cognac is able to rapidly identify phylogenetic marker genes using a data driven approach and efficiently generate concatenated gene alignments for very large genomic datasets. To benchmark our tool, we generated core gene alignments for eight unique genera of bacteria, including a dataset of over 11,000 genomes from the genus Escherichia producing an alignment with 1353 genes, which was constructed in less than 17 h.

Conclusions

We demonstrate that cognac presents an efficient method for generating concatenated gene alignments for phylogenetic analysis. We have released cognac as an R package ( https://github.com/rdcrawford/cognac ) with customizable parameters for adaptation to diverse applications.

SUBMITTER: Crawford RD 

PROVIDER: S-EPMC7885345 | biostudies-literature | 2021 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

cognac: rapid generation of concatenated gene alignments for phylogenetic inference from large, bacterial whole genome sequencing datasets.

Crawford Ryan D RD   Snitkin Evan S ES  

BMC bioinformatics 20210215 1


<h4>Background</h4>The quantity of genomic data is expanding at an increasing rate. Tools for phylogenetic analysis which scale to the quantity of available data are required. To address this need, we present cognac, a user-friendly software package to rapidly generate concatenated gene alignments for phylogenetic analysis.<h4>Results</h4>We illustrate that cognac is able to rapidly identify phylogenetic marker genes using a data driven approach and efficiently generate concatenated gene alignme  ...[more]

Similar Datasets

| S-EPMC4330336 | biostudies-literature
| S-EPMC2647833 | biostudies-literature
| S-EPMC8317108 | biostudies-literature
| S-EPMC4251999 | biostudies-literature
| S-EPMC3416384 | biostudies-literature
| S-EPMC4538881 | biostudies-literature
| S-EPMC9113349 | biostudies-literature
| S-EPMC6726478 | biostudies-literature
| S-EPMC6294524 | biostudies-literature
| S-EPMC2760884 | biostudies-other