Unknown

Dataset Information

0

Phy-Mer: a novel alignment-free and reference-independent mitochondrial haplogroup classifier.


ABSTRACT: MOTIVATION:All current mitochondrial haplogroup classification tools require variants to be detected from an alignment with the reference sequence and to be properly named according to the canonical nomenclature standards for describing mitochondrial variants, before they can be compared with the haplogroup determining polymorphisms. With the emergence of high-throughput sequencing technologies and hence greater availability of mitochondrial genome sequences, there is a strong need for an automated haplogroup classification tool that is alignment-free and agnostic to reference sequence. RESULTS:We have developed a novel mitochondrial genome haplogroup-defining algorithm using a k-mer approach namely Phy-Mer. Phy-Mer performs equally well as the leading haplogroup classifier, HaploGrep, while avoiding the errors that may occur when preparing variants to required formats and notations. We have further expanded Phy-Mer functionality such that next-generation sequencing data can be used directly as input. AVAILABILITY AND IMPLEMENTATION:Phy-Mer is publicly available under the GNU Affero General Public License v3.0 on GitHub (https://github.com/danielnavarrogomez/phy-mer). CONTACT:Xiaowu_Gai@meei.harvard.edu SUPPLEMENTARY INFORMATION:Supplementary data are available at Bioinformatics online.

SUBMITTER: Navarro-Gomez D 

PROVIDER: S-EPMC4393525 | biostudies-literature | 2015 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Phy-Mer: a novel alignment-free and reference-independent mitochondrial haplogroup classifier.

Navarro-Gomez Daniel D   Leipzig Jeremy J   Shen Lishuang L   Lott Marie M   Stassen Alphons P M AP   Wallace Douglas C DC   Wiggs Janey L JL   Falk Marni J MJ   van Oven Mannis M   Gai Xiaowu X  

Bioinformatics (Oxford, England) 20141212 8


<h4>Motivation</h4>All current mitochondrial haplogroup classification tools require variants to be detected from an alignment with the reference sequence and to be properly named according to the canonical nomenclature standards for describing mitochondrial variants, before they can be compared with the haplogroup determining polymorphisms. With the emergence of high-throughput sequencing technologies and hence greater availability of mitochondrial genome sequences, there is a strong need for a  ...[more]

Similar Datasets

| S-EPMC10603592 | biostudies-literature
| S-EPMC7538862 | biostudies-literature
| S-EPMC3341554 | biostudies-literature
| S-EPMC4832552 | biostudies-literature
| S-EPMC6022720 | biostudies-literature
| S-EPMC4792907 | biostudies-literature
| S-EPMC5244389 | biostudies-literature
| S-EPMC6314158 | biostudies-literature
| S-EPMC3046486 | biostudies-literature
| S-EPMC8409003 | biostudies-literature