Unknown

Dataset Information

0

Prioritizing Clinically Relevant Copy Number Variation from Genetic Interactions and Gene Function Data.


ABSTRACT: It is becoming increasingly necessary to develop computerized methods for identifying the few disease-causing variants from hundreds discovered in each individual patient. This problem is especially relevant for Copy Number Variants (CNVs), which can be cheaply interrogated via low-cost hybridization arrays commonly used in clinical practice. We present a method to predict the disease relevance of CNVs that combines functional context and clinical phenotype to discover clinically harmful CNVs (and likely causative genes) in patients with a variety of phenotypes. We compare several feature and gene weighing systems for classifying both genes and CNVs. We combined the best performing methodologies and parameters on over 2,500 Agilent CGH 180k Microarray CNVs derived from 140 patients. Our method achieved an F-score of 91.59%, with 87.08% precision and 97.00% recall. Our methods are freely available at https://github.com/compbio-UofT/cnv-prioritization. Our dataset is included with the supplementary information.

SUBMITTER: Foong J 

PROVIDER: S-EPMC4593641 | biostudies-literature | 2015

REPOSITORIES: biostudies-literature

altmetric image

Publications

Prioritizing Clinically Relevant Copy Number Variation from Genetic Interactions and Gene Function Data.

Foong Justin J   Girdea Marta M   Stavropoulos James J   Brudno Michael M  

PloS one 20151005 10


It is becoming increasingly necessary to develop computerized methods for identifying the few disease-causing variants from hundreds discovered in each individual patient. This problem is especially relevant for Copy Number Variants (CNVs), which can be cheaply interrogated via low-cost hybridization arrays commonly used in clinical practice. We present a method to predict the disease relevance of CNVs that combines functional context and clinical phenotype to discover clinically harmful CNVs (a  ...[more]

Similar Datasets

| S-EPMC4532872 | biostudies-literature
| S-EPMC6586881 | biostudies-literature
| S-EPMC3158569 | biostudies-literature
2015-07-28 | GSE70374 | GEO
| S-EPMC5460076 | biostudies-literature
2015-07-28 | E-GEOD-70374 | biostudies-arrayexpress
| S-EPMC2920188 | biostudies-literature
| S-EPMC3530597 | biostudies-literature
| S-EPMC5605662 | biostudies-literature
| S-EPMC3409265 | biostudies-literature