Unknown

Dataset Information

0

ProGeM: a framework for the prioritization of candidate causal genes at molecular quantitative trait loci.


ABSTRACT: Quantitative trait locus (QTL) mapping of molecular phenotypes such as metabolites, lipids and proteins through genome-wide association studies represents a powerful means of highlighting molecular mechanisms relevant to human diseases. However, a major challenge of this approach is to identify the causal gene(s) at the observed QTLs. Here, we present a framework for the 'Prioritization of candidate causal Genes at Molecular QTLs' (ProGeM), which incorporates biological domain-specific annotation data alongside genome annotation data from multiple repositories. We assessed the performance of ProGeM using a reference set of 227 previously reported and extensively curated metabolite QTLs. For 98% of these loci, the expert-curated gene was one of the candidate causal genes prioritized by ProGeM. Benchmarking analyses revealed that 69% of the causal candidates were nearest to the sentinel variant at the investigated molecular QTLs, indicating that genomic proximity is the most reliable indicator of 'true positive' causal genes. In contrast, cis-gene expression QTL data led to three false positive candidate causal gene assignments for every one true positive assignment. We provide evidence that these conclusions also apply to other molecular phenotypes, suggesting that ProGeM is a powerful and versatile tool for annotating molecular QTLs. ProGeM is freely available via GitHub.

SUBMITTER: Stacey D 

PROVIDER: S-EPMC6326795 | biostudies-literature | 2019 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

ProGeM: a framework for the prioritization of candidate causal genes at molecular quantitative trait loci.

Stacey David D   Fauman Eric B EB   Ziemek Daniel D   Sun Benjamin B BB   Harshfield Eric L EL   Wood Angela M AM   Butterworth Adam S AS   Suhre Karsten K   Paul Dirk S DS  

Nucleic acids research 20190101 1


Quantitative trait locus (QTL) mapping of molecular phenotypes such as metabolites, lipids and proteins through genome-wide association studies represents a powerful means of highlighting molecular mechanisms relevant to human diseases. However, a major challenge of this approach is to identify the causal gene(s) at the observed QTLs. Here, we present a framework for the 'Prioritization of candidate causal Genes at Molecular QTLs' (ProGeM), which incorporates biological domain-specific annotatio  ...[more]

Similar Datasets

| S-EPMC4374467 | biostudies-literature
| S-EPMC8187656 | biostudies-literature
| S-EPMC10886872 | biostudies-literature
| S-EPMC7787474 | biostudies-literature
| S-EPMC8187908 | biostudies-literature
| S-EPMC4899527 | biostudies-other
| S-EPMC10706376 | biostudies-literature
| S-EPMC8440299 | biostudies-literature
| S-EPMC9870841 | biostudies-literature
| S-EPMC2034641 | biostudies-literature