Unknown

Dataset Information

0

Intergenic disease-associated regions are abundant in novel transcripts.


ABSTRACT:

Background

Genotyping of large populations through genome-wide association studies (GWAS) has successfully identified many genomic variants associated with traits or disease risk. Unexpectedly, a large proportion of GWAS single nucleotide polymorphisms (SNPs) and associated haplotype blocks are in intronic and intergenic regions, hindering their functional evaluation. While some of these risk-susceptibility regions encompass cis-regulatory sites, their transcriptional potential has never been systematically explored.

Results

To detect rare tissue-specific expression, we employed the transcript-enrichment method CaptureSeq on 21 human tissues to identify 1775 multi-exonic transcripts from 561 intronic and intergenic haploblocks associated with 392 traits and diseases, covering 73.9 Mb (2.2%) of the human genome. We show that a large proportion (85%) of disease-associated haploblocks express novel multi-exonic non-coding transcripts that are tissue-specific and enriched for GWAS SNPs as well as epigenetic markers of active transcription and enhancer activity. Similarly, we captured transcriptomes from 13 melanomas, targeting nine melanoma-associated haploblocks, and characterized 31 novel melanoma-specific transcripts that include fusion proteins, novel exons and non-coding RNAs, one-third of which showed allelically imbalanced expression.

Conclusions

This resource of previously unreported transcripts in disease-associated regions ( http://gwas-captureseq.dingerlab.org ) should provide an important starting point for the translational community in search of novel biomarkers, disease mechanisms, and drug targets.

SUBMITTER: Bartonicek N 

PROVIDER: S-EPMC5747244 | biostudies-literature | 2017 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications


<h4>Background</h4>Genotyping of large populations through genome-wide association studies (GWAS) has successfully identified many genomic variants associated with traits or disease risk. Unexpectedly, a large proportion of GWAS single nucleotide polymorphisms (SNPs) and associated haplotype blocks are in intronic and intergenic regions, hindering their functional evaluation. While some of these risk-susceptibility regions encompass cis-regulatory sites, their transcriptional potential has never  ...[more]

Similar Datasets

2017-07-04 | E-MTAB-4519 | biostudies-arrayexpress
| PRJEB21619 | ENA
| S-EPMC1370778 | biostudies-literature
2005-06-03 | E-GEOD-2347 | biostudies-arrayexpress
2005-06-03 | GSE2347 | GEO
| S-EPMC3412814 | biostudies-literature
| S-EPMC3681616 | biostudies-literature
| S-EPMC1800667 | biostudies-literature
| S-EPMC7605252 | biostudies-literature
| PRJEB44845 | ENA