Unknown

Dataset Information

0

Clusters of acidic and hydrophobic residues can predict acidic transcriptional activation domains from protein sequence.


ABSTRACT: Transcription factors activate gene expression in development, homeostasis, and stress with DNA binding domains and activation domains. Although there exist excellent computational models for predicting DNA binding domains from protein sequence, models for predicting activation domains from protein sequence have lagged, particularly in metazoans. We recently developed a simple and accurate predictor of acidic activation domains on human transcription factors. Here, we show how the accuracy of this human predictor arises from the clustering of aromatic, leucine, and acidic residues, which together are necessary for acidic activation domain function. When we combine our predictor with the predictions of convolutional neural network (CNN) models trained in yeast, the intersection is more accurate than individual models, emphasizing that each approach carries orthogonal information. We synthesize these findings into a new set of activation domain predictions on human transcription factors.

SUBMITTER: Kotha SR 

PROVIDER: S-EPMC10550315 | biostudies-literature | 2023 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Clusters of acidic and hydrophobic residues can predict acidic transcriptional activation domains from protein sequence.

Kotha Sanjana R SR   Staller Max Valentín MV  

Genetics 20231001 2


Transcription factors activate gene expression in development, homeostasis, and stress with DNA binding domains and activation domains. Although there exist excellent computational models for predicting DNA binding domains from protein sequence, models for predicting activation domains from protein sequence have lagged, particularly in metazoans. We recently developed a simple and accurate predictor of acidic activation domains on human transcription factors. Here, we show how the accuracy of th  ...[more]

Similar Datasets

2021-12-08 | GSE190288 | GEO
| S-EPMC9241528 | biostudies-literature
| PRJNA786592 | ENA
| S-EPMC7763487 | biostudies-literature
| S-EPMC2254393 | biostudies-literature
| S-EPMC3985445 | biostudies-literature
| S-EPMC11642888 | biostudies-literature
| S-EPMC4599017 | biostudies-literature
| S-EPMC1221879 | biostudies-other
| S-EPMC11193542 | biostudies-literature