Unknown

Dataset Information

0

Trainable high resolution melt curve machine learning classifier for large-scale reliable genotyping of sequence variants.


ABSTRACT: High resolution melt (HRM) is gaining considerable popularity as a simple and robust method for genotyping sequence variants. However, accurate genotyping of an unknown sample for which a large number of possible variants may exist will require an automated HRM curve identification method capable of comparing unknowns against a large cohort of known sequence variants. Herein, we describe a new method for automated HRM curve classification based on machine learning methods and learned tolerance for reaction condition deviations. We tested this method in silico through multiple cross-validations using curves generated from 9 different simulated experimental conditions to classify 92 known serotypes of Streptococcus pneumoniae and demonstrated over 99% accuracy with 8 training curves per serotype. In vitro verification of the algorithm was tested using sequence variants of a cancer-related gene and demonstrated 100% accuracy with 3 training curves per sequence variant. The machine learning algorithm enabled reliable, scalable, and automated HRM genotyping analysis with broad potential clinical and epidemiological applications.

SUBMITTER: Athamanolap P 

PROVIDER: S-EPMC4183555 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

altmetric image

Publications

Trainable high resolution melt curve machine learning classifier for large-scale reliable genotyping of sequence variants.

Athamanolap Pornpat P   Parekh Vishwa V   Fraley Stephanie I SI   Agarwal Vatsal V   Shin Dong J DJ   Jacobs Michael A MA   Wang Tza-Huei TH   Yang Samuel S  

PloS one 20141002 9


High resolution melt (HRM) is gaining considerable popularity as a simple and robust method for genotyping sequence variants. However, accurate genotyping of an unknown sample for which a large number of possible variants may exist will require an automated HRM curve identification method capable of comparing unknowns against a large cohort of known sequence variants. Herein, we describe a new method for automated HRM curve classification based on machine learning methods and learned tolerance f  ...[more]

Similar Datasets

| S-EPMC4726007 | biostudies-literature
| S-EPMC4597282 | biostudies-literature
| S-EPMC8108535 | biostudies-literature
| S-EPMC10477177 | biostudies-literature
| S-EPMC5754072 | biostudies-literature
| S-EPMC3020821 | biostudies-literature
| S-EPMC9994400 | biostudies-literature
| S-EPMC2876125 | biostudies-literature
| S-EPMC2620869 | biostudies-literature
| S-EPMC5773611 | biostudies-literature