Dataset Information

Prioritizing genes for X-linked diseases using population exome data.

ABSTRACT: Many new disease genes can be identified through high-throughput sequencing. Yet, variant interpretation for the large amounts of genomic data remains a challenge given variation of uncertain significance and genes that lack disease annotation. As clinically significant disease genes may be subject to negative selection, we developed a prediction method that measures paucity of non-synonymous variation in the human population to infer gene-based pathogenicity. Integrating human exome data of over 6000 individuals from the NHLBI Exome Sequencing Project, we tested the utility of the prediction method based on the ratio of non-synonymous to synonymous substitution rates (dN/dS) on X-chromosome genes. A low dN/dS ratio characterized genes associated with childhood disease and outcome. Furthermore, we identify new candidates for diseases with early mortality and demonstrate intragenic localized patterns of variants that suggest pathogenic hotspots. Our results suggest that intrahuman substitution analysis is a valuable tool to help prioritize novel disease genes in sequence interpretation.

SUBMITTER: Ge X

PROVIDER: S-EPMC4291241 | biostudies-literature | 2015 Feb

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Prioritizing genes for X-linked diseases using population exome data.

Ge Xiaoyan X Kwok Pui-Yan PY Shieh Joseph T C JT

Human molecular genetics 20140912 3

Many new disease genes can be identified through high-throughput sequencing. Yet, variant interpretation for the large amounts of genomic data remains a challenge given variation of uncertain significance and genes that lack disease annotation. As clinically significant disease genes may be subject to negative selection, we developed a prediction method that measures paucity of non-synonymous variation in the human population to infer gene-based pathogenicity. Integrating human exome data of ove ...[more]

PMID: 25217573

Dataset Information

Prioritizing genes for X-linked diseases using population exome data.

Publications

Prioritizing genes for X-linked diseases using population exome data.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Prioritizing Parkinson's disease genes using population-scale transcriptomic data.
| S-EPMC6397174 | biostudies-literature

driveR: a novel method for prioritizing cancer driver genes using somatic genomics data.
| S-EPMC8142487 | biostudies-literature

Prioritizing Autism Risk Genes using Personalized Graphical Models Estimated from Single Cell RNA-seq Data.
| S-EPMC9070996 | biostudies-literature

Prioritizing causal disease genes using unbiased genomic features.
| S-EPMC4279789 | biostudies-literature

Prioritizing candidate genes post-GWAS using multiple sources of data for mastitis resistance in dairy cattle.
| S-EPMC6127918 | biostudies-literature

Prioritizing disease-linked variants, genes, and pathways with an interactive whole-genome analysis pipeline.
| S-EPMC4130156 | biostudies-literature

Network-based integration of multi-omics data for prioritizing cancer genes.
| S-EPMC6041755 | biostudies-literature

PanDrugs2: prioritizing cancer therapies using integrated individual multi-omics data.
| S-EPMC10320188 | biostudies-literature

Reference exome data for a Northern Brazilian population.
| S-EPMC7578642 | biostudies-literature

A pipeline combining multiple strategies for prioritizing heterozygous variants for the identification of candidate genes in exome datasets.
| S-EPMC5441048 | biostudies-literature