Unknown

Dataset Information

0

Improving CNV Detection Performance in Microarray Data Using a Machine Learning-Based Approach.


ABSTRACT: Copy number variation (CNV) is a primary source of structural variation in the human genome, leading to several disorders. Therefore, analyzing neonatal CNVs is crucial for managing CNV-related chromosomal disabilities. However, genomic waves can hinder accurate CNV analysis. To mitigate the influences of the waves, we adopted a machine learning approach and developed a new method that uses a modified log R ratio instead of the commonly used log R ratio. Validation results using samples with known CNVs demonstrated the superior performance of our method. We analyzed a total of 16,046 Korean newborn samples using the new method and identified CNVs related to 39 genetic disorders were identified in 342 cases. The most frequently detected CNV-related disorder was Joubert syndrome 4. The accuracy of our method was further confirmed by analyzing a subset of the detected results using NGS and comparing them with our results. The utilization of a genome-wide single nucleotide polymorphism array with wave offset was shown to be a powerful method for identifying CNVs in neonatal cases. The accurate screening and the ability to identify various disease susceptibilities offered by our new method could facilitate the identification of CNV-associated chromosomal disease etiologies.

SUBMITTER: Goh CJ 

PROVIDER: S-EPMC10871075 | biostudies-literature | 2023 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Improving CNV Detection Performance in Microarray Data Using a Machine Learning-Based Approach.

Goh Chul Jun CJ   Kwon Hyuk-Jung HJ   Kim Yoonhee Y   Jung Seunghee S   Park Jiwoo J   Lee Isaac Kise IK   Park Bo-Ram BR   Kim Myeong-Ji MJ   Kim Min-Jeong MJ   Lee Min-Seob MS  

Diagnostics (Basel, Switzerland) 20231229 1


Copy number variation (CNV) is a primary source of structural variation in the human genome, leading to several disorders. Therefore, analyzing neonatal CNVs is crucial for managing CNV-related chromosomal disabilities. However, genomic waves can hinder accurate CNV analysis. To mitigate the influences of the waves, we adopted a machine learning approach and developed a new method that uses a modified log R ratio instead of the commonly used log R ratio. Validation results using samples with kno  ...[more]

Similar Datasets

| S-EPMC4184266 | biostudies-literature
| S-EPMC9623026 | biostudies-literature
| S-EPMC8809395 | biostudies-literature
| S-EPMC8434514 | biostudies-literature
| S-EPMC11899459 | biostudies-literature
| S-EPMC9936926 | biostudies-literature
| S-EPMC9071249 | biostudies-literature
| S-EPMC8128240 | biostudies-literature
| S-EPMC7591033 | biostudies-literature
2024-12-27 | GSE246294 | GEO