Unknown

Dataset Information

0

PALADIN: protein alignment for functional profiling whole metagenome shotgun data.


ABSTRACT: Whole metagenome shotgun sequencing is a powerful approach for assaying the functional potential of microbial communities. We currently lack tools that efficiently and accurately align DNA reads against protein references, the technique necessary for constructing a functional profile. Here, we present PALADIN-a novel modification of the Burrows-Wheeler Aligner that provides accurate alignment, robust reporting capabilities and orders-of-magnitude improved efficiency by directly mapping in protein space.We compared the accuracy and efficiency of PALADIN against existing tools that employ nucleotide or protein alignment algorithms. Using simulated reads, PALADIN consistently outperformed the popular DNA read mappers BWA and NovoAlign in detected proteins, percentage of reads mapped and ontological similarity. We also compared PALADIN against four existing protein alignment tools: BLASTX, RAPSearch2, DIAMOND and Lambda, using empirically obtained reads. PALADIN yielded results seven times faster than the best performing alternative, DIAMOND and nearly 8000 times faster than BLASTX. PALADIN's accuracy was comparable to all tested solutions.PALADIN was implemented in C, and its source code and documentation are available at https://github.com/twestbrookunh/paladin.anthonyw@wildcats.unh.edu.Supplementary data are available at Bioinformatics online.

SUBMITTER: Westbrook A 

PROVIDER: S-EPMC5423455 | biostudies-literature | 2017 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

PALADIN: protein alignment for functional profiling whole metagenome shotgun data.

Westbrook Anthony A   Ramsdell Jordan J   Schuelke Taruna T   Normington Louisa L   Bergeron R Daniel RD   Thomas W Kelley WK   MacManes Matthew D MD  

Bioinformatics (Oxford, England) 20170501 10


<h4>Motivation</h4>Whole metagenome shotgun sequencing is a powerful approach for assaying the functional potential of microbial communities. We currently lack tools that efficiently and accurately align DNA reads against protein references, the technique necessary for constructing a functional profile. Here, we present PALADIN-a novel modification of the Burrows-Wheeler Aligner that provides accurate alignment, robust reporting capabilities and orders-of-magnitude improved efficiency by directl  ...[more]

Similar Datasets

| S-EPMC5070866 | biostudies-literature
| S-EPMC6693478 | biostudies-literature
| S-EPMC7931531 | biostudies-literature
| S-EPMC8377382 | biostudies-literature
| S-EPMC3464612 | biostudies-literature
| S-EPMC4687345 | biostudies-other
| S-EPMC6879543 | biostudies-literature
| S-EPMC5075713 | biostudies-literature
| S-EPMC4365909 | biostudies-literature
| S-EPMC5493203 | biostudies-literature