Unknown

Dataset Information

0

High-throughput discovery of functional disordered regions: investigation of transactivation domains.


ABSTRACT: Over 40% of proteins in any eukaryotic genome encode intrinsically disordered regions (IDRs) that do not adopt defined tertiary structures. Certain IDRs perform critical functions, but discovering them is non-trivial as the biological context determines their function. We present IDR-Screen, a framework to discover functional IDRs in a high-throughput manner by simultaneously assaying large numbers of DNA sequences that code for short disordered sequences. Functionality-conferring patterns in their protein sequence are inferred through statistical learning. Using yeast HSF1 transcription factor-based assay, we discovered IDRs that function as transactivation domains (TADs) by screening a random sequence library and a designed library consisting of variants of 13 diverse TADs. Using machine learning, we find that segments devoid of positively charged residues but with redundant short sequence patterns of negatively charged and aromatic residues are a generic feature for TAD functionality. We anticipate that investigating defined sequence libraries using IDR-Screen for specific functions can facilitate discovering novel and functional regions of the disordered proteome as well as understand the impact of natural and disease variants in disordered segments.

SUBMITTER: Ravarani CN 

PROVIDER: S-EPMC5949888 | biostudies-literature | 2018 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

High-throughput discovery of functional disordered regions: investigation of transactivation domains.

Ravarani Charles Nj CN   Erkina Tamara Y TY   De Baets Greet G   Dudman Daniel C DC   Erkine Alexandre M AM   Babu M Madan MM  

Molecular systems biology 20180514 5


Over 40% of proteins in any eukaryotic genome encode intrinsically disordered regions (IDRs) that do not adopt defined tertiary structures. Certain IDRs perform critical functions, but discovering them is non-trivial as the biological context determines their function. We present IDR-Screen, a framework to discover functional IDRs in a high-throughput manner by simultaneously assaying large numbers of DNA sequences that code for short disordered sequences. Functionality-conferring patterns in th  ...[more]

Similar Datasets

2024-02-02 | GSE254492 | GEO
| S-EPMC4908364 | biostudies-literature
| S-EPMC3510515 | biostudies-literature
| S-EPMC6818164 | biostudies-literature
2018-07-10 | GSE114387 | GEO
| S-EPMC8508753 | biostudies-literature
| S-EPMC5570202 | biostudies-literature
| S-EPMC7238970 | biostudies-literature
| S-EPMC4876815 | biostudies-literature
| S-EPMC3760854 | biostudies-literature