Perspectives on Codebook: sequence specificity of uncharacterized human transcription factors
Ontology highlight
ABSTRACT: We describe an effort (“Codebook”) to determine the sequence specificity of 332 putative and largely uncharacterized human transcription factors (TFs), as well as 61 control TFs. Nearly 5,000 independent experiments, including in vitro and in vivo assays, produced motifs for most of the uncharacterized TFs analyzed (180, or 53%), the vast majority of which are unique to a single TF. The data highlight the extensive contribution of transposable elements to TF evolution, both in cis and trans, and identify tens of thousands of conserved, base-level binding sites in the human genome. The use of multiple platforms provides an unprecedented opportunity to benchmark and analyze TF sequence specificity, function, and evolution, as further explored in accompanying manuscripts. Over 1,421 human TFs are now associated with a DNA binding motif. Extrapolation from the Codebook benchmarking suggests that many of the binding motifs for well-studied TFs may inaccurately describe the TF’s true sequence preferences.
ORGANISM(S): synthetic construct Homo sapiens
PROVIDER: GSE275577 | GEO | 2024/11/11
REPOSITORIES: GEO
ACCESS DATA