Unknown

Dataset Information

0

Prioritizing genes for X-linked diseases using population exome data.


ABSTRACT: Many new disease genes can be identified through high-throughput sequencing. Yet, variant interpretation for the large amounts of genomic data remains a challenge given variation of uncertain significance and genes that lack disease annotation. As clinically significant disease genes may be subject to negative selection, we developed a prediction method that measures paucity of non-synonymous variation in the human population to infer gene-based pathogenicity. Integrating human exome data of over 6000 individuals from the NHLBI Exome Sequencing Project, we tested the utility of the prediction method based on the ratio of non-synonymous to synonymous substitution rates (dN/dS) on X-chromosome genes. A low dN/dS ratio characterized genes associated with childhood disease and outcome. Furthermore, we identify new candidates for diseases with early mortality and demonstrate intragenic localized patterns of variants that suggest pathogenic hotspots. Our results suggest that intrahuman substitution analysis is a valuable tool to help prioritize novel disease genes in sequence interpretation.

SUBMITTER: Ge X 

PROVIDER: S-EPMC4291241 | biostudies-literature | 2015 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Prioritizing genes for X-linked diseases using population exome data.

Ge Xiaoyan X   Kwok Pui-Yan PY   Shieh Joseph T C JT  

Human molecular genetics 20140912 3


Many new disease genes can be identified through high-throughput sequencing. Yet, variant interpretation for the large amounts of genomic data remains a challenge given variation of uncertain significance and genes that lack disease annotation. As clinically significant disease genes may be subject to negative selection, we developed a prediction method that measures paucity of non-synonymous variation in the human population to infer gene-based pathogenicity. Integrating human exome data of ove  ...[more]

Similar Datasets

| S-EPMC6397174 | biostudies-literature
| S-EPMC8142487 | biostudies-literature
| S-EPMC4279789 | biostudies-literature
| S-EPMC6127918 | biostudies-literature
| S-EPMC6041755 | biostudies-literature
| S-EPMC4130156 | biostudies-literature
| S-EPMC7578642 | biostudies-literature
| S-EPMC4046670 | biostudies-literature
| S-EPMC5441048 | biostudies-literature
| S-EPMC3129253 | biostudies-literature