Unknown

Dataset Information

0

Pse-in-One: a web server for generating various modes of pseudo components of DNA, RNA, and protein sequences.


ABSTRACT: With the avalanche of biological sequences generated in the post-genomic age, one of the most challenging problems in computational biology is how to effectively formulate the sequence of a biological sample (such as DNA, RNA or protein) with a discrete model or a vector that can effectively reflect its sequence pattern information or capture its key features concerned. Although several web servers and stand-alone tools were developed to address this problem, all these tools, however, can only handle one type of samples. Furthermore, the number of their built-in properties is limited, and hence it is often difficult for users to formulate the biological sequences according to their desired features or properties. In this article, with a much larger number of built-in properties, we are to propose a much more flexible web server called Pse-in-One (http://bioinformatics.hitsz.edu.cn/Pse-in-One/), which can, through its 28 different modes, generate nearly all the possible feature vectors for DNA, RNA and protein sequences. Particularly, it can also generate those feature vectors with the properties defined by users themselves. These feature vectors can be easily combined with machine-learning algorithms to develop computational predictors and analysis methods for various tasks in bioinformatics and system biology. It is anticipated that the Pse-in-One web server will become a very useful tool in computational proteomics, genomics, as well as biological sequence analysis. Moreover, to maximize users' convenience, its stand-alone version can also be downloaded from http://bioinformatics.hitsz.edu.cn/Pse-in-One/download/, and directly run on Windows, Linux, Unix and Mac OS.

SUBMITTER: Liu B 

PROVIDER: S-EPMC4489303 | biostudies-literature | 2015 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Pse-in-One: a web server for generating various modes of pseudo components of DNA, RNA, and protein sequences.

Liu Bin B   Liu Fule F   Wang Xiaolong X   Chen Junjie J   Fang Longyun L   Chou Kuo-Chen KC  

Nucleic acids research 20150509 W1


With the avalanche of biological sequences generated in the post-genomic age, one of the most challenging problems in computational biology is how to effectively formulate the sequence of a biological sample (such as DNA, RNA or protein) with a discrete model or a vector that can effectively reflect its sequence pattern information or capture its key features concerned. Although several web servers and stand-alone tools were developed to address this problem, all these tools, however, can only h  ...[more]

Similar Datasets

| S-EPMC4987903 | biostudies-literature
| S-EPMC3380018 | biostudies-literature
| S-EPMC310868 | biostudies-literature
| S-EPMC1538864 | biostudies-literature
| S-EPMC9252776 | biostudies-literature
| S-EPMC2744726 | biostudies-literature
| S-EPMC8275979 | biostudies-literature
| S-EPMC9252814 | biostudies-literature
| S-EPMC7595847 | biostudies-literature