Unknown

Dataset Information

0

IFeature: a Python package and web server for features extraction and selection from protein and peptide sequences.


ABSTRACT: Summary:Structural and physiochemical descriptors extracted from sequence data have been widely used to represent sequences and predict structural, functional, expression and interaction profiles of proteins and peptides as well as DNAs/RNAs. Here, we present iFeature, a versatile Python-based toolkit for generating various numerical feature representation schemes for both protein and peptide sequences. iFeature is capable of calculating and extracting a comprehensive spectrum of 18 major sequence encoding schemes that encompass 53 different types of feature descriptors. It also allows users to extract specific amino acid properties from the AAindex database. Furthermore, iFeature integrates 12 different types of commonly used feature clustering, selection and dimensionality reduction algorithms, greatly facilitating training, analysis and benchmarking of machine-learning models. The functionality of iFeature is made freely available via an online web server and a stand-alone toolkit. Availability and implementation:http://iFeature.erc.monash.edu/; https://github.com/Superzchen/iFeature/. Supplementary information:Supplementary data are available at Bioinformatics online.

SUBMITTER: Chen Z 

PROVIDER: S-EPMC6658705 | biostudies-literature | 2018 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

iFeature: a Python package and web server for features extraction and selection from protein and peptide sequences.

Chen Zhen Z   Zhao Pei P   Li Fuyi F   Leier André A   Marquez-Lago Tatiana T TT   Wang Yanan Y   Webb Geoffrey I GI   Smith A Ian AI   Daly Roger J RJ   Chou Kuo-Chen KC   Song Jiangning J  

Bioinformatics (Oxford, England) 20180701 14


<h4>Summary</h4>Structural and physiochemical descriptors extracted from sequence data have been widely used to represent sequences and predict structural, functional, expression and interaction profiles of proteins and peptides as well as DNAs/RNAs. Here, we present iFeature, a versatile Python-based toolkit for generating various numerical feature representation schemes for both protein and peptide sequences. iFeature is capable of calculating and extracting a comprehensive spectrum of 18 majo  ...[more]

Similar Datasets

| S-EPMC4122144 | biostudies-literature
| S-EPMC3842755 | biostudies-literature
| S-EPMC5570166 | biostudies-literature
| S-EPMC7968678 | biostudies-literature
| S-EPMC9252776 | biostudies-literature
| S-EPMC6042728 | biostudies-literature
| S-EPMC2744726 | biostudies-literature
| S-EPMC9252814 | biostudies-literature
| S-EPMC8769707 | biostudies-literature
| S-EPMC6030929 | biostudies-literature