Unknown

Dataset Information

0

Pretata: predicting TATA binding proteins with novel features and dimensionality reduction strategy.


ABSTRACT: It is necessary and essential to discovery protein function from the novel primary sequences. Wet lab experimental procedures are not only time-consuming, but also costly, so predicting protein structure and function reliably based only on amino acid sequence has significant value. TATA-binding protein (TBP) is a kind of DNA binding protein, which plays a key role in the transcription regulation. Our study proposed an automatic approach for identifying TATA-binding proteins efficiently, accurately, and conveniently. This method would guide for the special protein identification with computational intelligence strategies.Firstly, we proposed novel fingerprint features for TBP based on pseudo amino acid composition, physicochemical properties, and secondary structure. Secondly, hierarchical features dimensionality reduction strategies were employed to improve the performance furthermore. Currently, Pretata achieves 92.92% TATA-binding protein prediction accuracy, which is better than all other existing methods.The experiments demonstrate that our method could greatly improve the prediction accuracy and speed, thus allowing large-scale NGS data prediction to be practical. A web server is developed to facilitate the other researchers, which can be accessed at http://server.malab.cn/preTata/ .

SUBMITTER: Zou Q 

PROVIDER: S-EPMC5259984 | biostudies-literature | 2016 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Pretata: predicting TATA binding proteins with novel features and dimensionality reduction strategy.

Zou Quan Q   Wan Shixiang S   Ju Ying Y   Tang Jijun J   Zeng Xiangxiang X  

BMC systems biology 20161223 Suppl 4


<h4>Background</h4>It is necessary and essential to discovery protein function from the novel primary sequences. Wet lab experimental procedures are not only time-consuming, but also costly, so predicting protein structure and function reliably based only on amino acid sequence has significant value. TATA-binding protein (TBP) is a kind of DNA binding protein, which plays a key role in the transcription regulation. Our study proposed an automatic approach for identifying TATA-binding proteins ef  ...[more]

Similar Datasets

| S-EPMC3471470 | biostudies-literature
| S-EPMC9329371 | biostudies-literature
| S-EPMC5337902 | biostudies-literature
| S-EPMC7753969 | biostudies-literature
| S-EPMC8505167 | biostudies-literature
| S-EPMC8129083 | biostudies-literature
| S-EPMC9345488 | biostudies-literature
2014-10-10 | E-GEOD-38237 | biostudies-arrayexpress
| S-EPMC7291640 | biostudies-literature
| S-EPMC4498733 | biostudies-literature