Unknown

Dataset Information

0

Nested Machine Learning Facilitates Increased Sequence Content for Large-Scale Automated High Resolution Melt Genotyping.


ABSTRACT: High Resolution Melt (HRM) is a versatile and rapid post-PCR DNA analysis technique primarily used to differentiate sequence variants among only a few short amplicons. We recently developed a one-vs-one support vector machine algorithm (OVO SVM) that enables the use of HRM for identifying numerous short amplicon sequences automatically and reliably. Herein, we set out to maximize the discriminating power of HRM?+?SVM for a single genetic locus by testing longer amplicons harboring significantly more sequence information. Using universal primers that amplify the hypervariable bacterial 16?S rRNA gene as a model system, we found that long amplicons yield more complex HRM curve shapes. We developed a novel nested OVO SVM approach to take advantage of this feature and achieved 100% accuracy in the identification of 37 clinically relevant bacteria in Leave-One-Out-Cross-Validation. A subset of organisms were independently tested. Those from pure culture were identified with high accuracy, while those tested directly from clinical blood bottles displayed more technical variability and reduced accuracy. Our findings demonstrate that long sequences can be accurately and automatically profiled by HRM with a novel nested SVM approach and suggest that clinical sample testing is feasible with further optimization.

SUBMITTER: Fraley SI 

PROVIDER: S-EPMC4726007 | biostudies-other | 2016 Jan

REPOSITORIES: biostudies-other

altmetric image

Publications

Nested Machine Learning Facilitates Increased Sequence Content for Large-Scale Automated High Resolution Melt Genotyping.

Fraley Stephanie I SI   Athamanolap Pornpat P   Masek Billie J BJ   Hardick Justin J   Carroll Karen C KC   Hsieh Yu-Hsiang YH   Rothman Richard E RE   Gaydos Charlotte A CA   Wang Tza-Huei TH   Yang Samuel S  

Scientific reports 20160118


High Resolution Melt (HRM) is a versatile and rapid post-PCR DNA analysis technique primarily used to differentiate sequence variants among only a few short amplicons. We recently developed a one-vs-one support vector machine algorithm (OVO SVM) that enables the use of HRM for identifying numerous short amplicon sequences automatically and reliably. Herein, we set out to maximize the discriminating power of HRM + SVM for a single genetic locus by testing longer amplicons harboring significantly  ...[more]

Similar Datasets

| S-EPMC6286316 | biostudies-literature
| S-EPMC2620869 | biostudies-literature
2022-07-27 | GSE209804 | GEO
| S-EPMC4659556 | biostudies-literature
2019-11-07 | GSE124203 | GEO
| S-EPMC3911307 | biostudies-literature
2022-08-14 | GSE184943 | GEO