Unknown

Dataset Information

0

WhatsGNU: a tool for identifying proteomic novelty.


ABSTRACT: To understand diversity in enormous collections of genome sequences, we need computationally scalable tools that can quickly contextualize individual genomes based on their similarities and identify features of each genome that make them unique. We present WhatsGNU, a tool based on exact match proteomic compression that, in seconds, classifies any new genome and provides a detailed report of protein alleles that may have novel functional differences. We use this technique to characterize the total allelic diversity (panallelome) of Salmonella enterica, Mycobacterium tuberculosis, Pseudomonas aeruginosa, and Staphylococcus aureus. It could be extended to others. WhatsGNU is available from https://github.com/ahmedmagds/WhatsGNU.

SUBMITTER: Moustafa AM 

PROVIDER: S-EPMC7059281 | biostudies-literature | 2020 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

WhatsGNU: a tool for identifying proteomic novelty.

Moustafa Ahmed M AM   Planet Paul J PJ  

Genome biology 20200305 1


To understand diversity in enormous collections of genome sequences, we need computationally scalable tools that can quickly contextualize individual genomes based on their similarities and identify features of each genome that make them unique. We present WhatsGNU, a tool based on exact match proteomic compression that, in seconds, classifies any new genome and provides a detailed report of protein alleles that may have novel functional differences. We use this technique to characterize the tot  ...[more]

Similar Datasets

| S-EPMC4423384 | biostudies-literature
2011-09-01 | E-GEOD-29098 | biostudies-arrayexpress
2011-09-01 | GSE29098 | GEO
| S-EPMC5712127 | biostudies-literature
| S-EPMC4941358 | biostudies-literature
| S-EPMC8302666 | biostudies-literature
| S-EPMC6795751 | biostudies-literature
| S-EPMC7057242 | biostudies-literature
| S-EPMC4803341 | biostudies-literature
| S-EPMC10654103 | biostudies-literature