Unknown

Dataset Information

0

Allele frequencies of variants in ultra conserved elements identify selective pressure on transcription factor binding.


ABSTRACT: Ultra-conserved genes or elements (UCGs/UCEs) in the human genome are extreme examples of conservation. We characterized natural variations in 2884 UCEs and UCGs in two distinct populations; Singaporean Chinese (n = 280) and Italian (n = 501) by using a pooled sample, targeted capture, sequencing approach. We identify, with high confidence, in these regions the abundance of rare SNVs (MAF<0.5%) of which 75% is not present in dbSNP137. UCEs association studies for complex human traits can use this information to model expected background variation and thus necessary power for association studies. By combining our data with 1000 Genome Project data, we show in three independent datasets that prevalent UCE variants (MAF>5%) are more often found in relatively less-conserved nucleotides within UCEs, compared to rare variants. Moreover, prevalent variants are less likely to overlap transcription factor binding site. Using SNPfold we found no significant influence of RNA secondary structure on UCE conservation. All together, these results suggest UCEs are not under selective pressure as a stretch of DNA but are under differential evolutionary pressure on the single nucleotide level.

SUBMITTER: Silla T 

PROVIDER: S-EPMC4219694 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

altmetric image

Publications

Allele frequencies of variants in ultra conserved elements identify selective pressure on transcription factor binding.

Silla Toomas T   Kepp Katrin K   Tai E Shyong ES   Goh Liang L   Davila Sonia S   Catela Ivkovic Tina T   Calin George A GA   Calin George A GA   Voorhoeve P Mathijs PM  

PloS one 20141104 11


Ultra-conserved genes or elements (UCGs/UCEs) in the human genome are extreme examples of conservation. We characterized natural variations in 2884 UCEs and UCGs in two distinct populations; Singaporean Chinese (n = 280) and Italian (n = 501) by using a pooled sample, targeted capture, sequencing approach. We identify, with high confidence, in these regions the abundance of rare SNVs (MAF<0.5%) of which 75% is not present in dbSNP137. UCEs association studies for complex human traits can use thi  ...[more]

Similar Datasets

| S-EPMC3862641 | biostudies-literature
| S-EPMC11373383 | biostudies-literature
| S-EPMC6893198 | biostudies-literature
| S-EPMC10475782 | biostudies-literature
| S-EPMC9184574 | biostudies-literature
| S-EPMC2615030 | biostudies-literature
| S-EPMC6430334 | biostudies-literature
| S-EPMC3207674 | biostudies-literature
| S-EPMC10815393 | biostudies-literature
| S-EPMC9746316 | biostudies-literature