Unknown

Dataset Information

0

PanOCT: automated clustering of orthologs using conserved gene neighborhood for pan-genomic analysis of bacterial strains and closely related species.


ABSTRACT: Pan-genome ortholog clustering tool (PanOCT) is a tool for pan-genomic analysis of closely related prokaryotic species or strains. PanOCT uses conserved gene neighborhood information to separate recently diverged paralogs into orthologous clusters where homology-only clustering methods cannot. The results from PanOCT and three commonly used graph-based ortholog-finding programs were compared using a set of four publicly available strains of the same bacterial species. All four methods agreed on ?70% of the clusters and ?86% of the proteins. The clusters that did not agree were inspected for evidence of correctness resulting in 85 high-confidence manually curated clusters that were used to compare all four methods.

SUBMITTER: Fouts DE 

PROVIDER: S-EPMC3526259 | biostudies-literature | 2012 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

PanOCT: automated clustering of orthologs using conserved gene neighborhood for pan-genomic analysis of bacterial strains and closely related species.

Fouts Derrick E DE   Brinkac Lauren L   Beck Erin E   Inman Jason J   Sutton Granger G  

Nucleic acids research 20120816 22


Pan-genome ortholog clustering tool (PanOCT) is a tool for pan-genomic analysis of closely related prokaryotic species or strains. PanOCT uses conserved gene neighborhood information to separate recently diverged paralogs into orthologous clusters where homology-only clustering methods cannot. The results from PanOCT and three commonly used graph-based ortholog-finding programs were compared using a set of four publicly available strains of the same bacterial species. All four methods agreed on  ...[more]

Similar Datasets

| S-EPMC2888116 | biostudies-literature
| S-EPMC6327494 | biostudies-literature
| S-EPMC3169350 | biostudies-literature
| S-EPMC3396514 | biostudies-literature
| S-EPMC7716509 | biostudies-literature
| S-EPMC4644643 | biostudies-literature
| S-EPMC7216594 | biostudies-literature
| S-EPMC5389983 | biostudies-literature
| S-EPMC2681618 | biostudies-literature
| S-EPMC7261169 | biostudies-literature