Ontology highlight
ABSTRACT: Motivation
Existing coalescent models and phylogenetic tools based on them are not designed for studying the genealogy of sequences like those of HIV, since in HIV recombinants with multiple cross-over points between the parental strains frequently arise. Hence, ambiguous cases in the classification of HIV sequences into subtypes and circulating recombinant forms (CRFs) have been treated with ad hoc methods in lack of tools based on a comprehensive coalescent model accounting for complex recombination patterns.Results
We developed the program ARGUS that scores classifications of sequences into subtypes and recombinant forms. It reconstructs ancestral recombination graphs (ARGs) that reflect the genealogy of the input sequences given a classification hypothesis. An ARG with maximal probability is approximated using a Markov chain Monte Carlo approach. ARGUS was able to distinguish the correct classification with a low error rate from plausible alternative classifications in simulation studies with realistic parameters. We applied our algorithm to decide between two recently debated alternatives in the classification of CRF02 of HIV-1 and find that CRF02 is indeed a recombinant of Subtypes A and G.Availability
ARGUS is implemented in C++ and the source code is available at http://gobics.de/software.
SUBMITTER: Bulla I
PROVIDER: S-EPMC2913666 | biostudies-literature | 2010 Jun
REPOSITORIES: biostudies-literature
Bulla Ingo I Schultz Anne-Kathrin AK Schreiber Fabian F Zhang Ming M Leitner Thomas T Korber Bette B Morgenstern Burkhard B Stanke Mario M
Bioinformatics (Oxford, England) 20100416 11
<h4>Motivation</h4>Existing coalescent models and phylogenetic tools based on them are not designed for studying the genealogy of sequences like those of HIV, since in HIV recombinants with multiple cross-over points between the parental strains frequently arise. Hence, ambiguous cases in the classification of HIV sequences into subtypes and circulating recombinant forms (CRFs) have been treated with ad hoc methods in lack of tools based on a comprehensive coalescent model accounting for complex ...[more]