Unknown

Dataset Information

0

Systematic Evaluation of DNA Sequence Variations on in vivo Transcription Factor Binding Affinity.


ABSTRACT: The majority of the single nucleotide variants (SNVs) identified by genome-wide association studies (GWAS) fall outside of the protein-coding regions. Elucidating the functional implications of these variants has been a major challenge. A possible mechanism for functional non-coding variants is that they disrupted the canonical transcription factor (TF) binding sites that affect the in vivo binding of the TF. However, their impact varies since many positions within a TF binding motif are not well conserved. Therefore, simply annotating all variants located in putative TF binding sites may overestimate the functional impact of these SNVs. We conducted a comprehensive survey to study the effect of SNVs on the TF binding affinity. A sequence-based machine learning method was used to estimate the change in binding affinity for each SNV located inside a putative motif site. From the results obtained on 18 TF binding motifs, we found that there is a substantial variation in terms of a SNV's impact on TF binding affinity. We found that only about 20% of SNVs located inside putative TF binding sites would likely to have significant impact on the TF-DNA binding.

SUBMITTER: Jin Y 

PROVIDER: S-EPMC8458901 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC6375646 | biostudies-literature
| S-EPMC4006273 | biostudies-literature
| S-EPMC4618392 | biostudies-literature
| S-EPMC4838337 | biostudies-literature
| S-EPMC5694663 | biostudies-literature
2022-04-29 | GSE196451 | GEO
| S-EPMC5042832 | biostudies-literature
| S-EPMC6497270 | biostudies-literature
| S-EPMC9351228 | biostudies-literature
| S-EPMC5870879 | biostudies-literature