Unknown

Dataset Information

0

A novel method for functional annotation prediction based on combination of classification methods.


ABSTRACT: Automated protein function prediction defines the designation of functions of unknown protein functions by using computational methods. This technique is useful to automatically assign gene functional annotations for undefined sequences in next generation genome analysis (NGS). NGS is a popular research method since high-throughput technologies such as DNA sequencing and microarrays have created large sets of genes. These huge sequences have greatly increased the need for analysis. Previous research has been based on the similarities of sequences as this is strongly related to the functional homology. However, this study aimed to designate protein functions by automatically predicting the function of the genome by utilizing InterPro (IPR), which can represent the properties of the protein family and groups of the protein function. Moreover, we used gene ontology (GO), which is the controlled vocabulary used to comprehensively describe the protein function. To define the relationship between IPR and GO terms, three pattern recognition techniques have been employed under different conditions, such as feature selection and weighted value, instead of a binary one.

SUBMITTER: Jung J 

PROVIDER: S-EPMC4124759 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

altmetric image

Publications

A novel method for functional annotation prediction based on combination of classification methods.

Jung Jaehee J   Lee Heung Ki HK   Yi Gangman G  

TheScientificWorldJournal 20140716


Automated protein function prediction defines the designation of functions of unknown protein functions by using computational methods. This technique is useful to automatically assign gene functional annotations for undefined sequences in next generation genome analysis (NGS). NGS is a popular research method since high-throughput technologies such as DNA sequencing and microarrays have created large sets of genes. These huge sequences have greatly increased the need for analysis. Previous rese  ...[more]

Similar Datasets

| S-EPMC2224172 | biostudies-literature
| S-EPMC6727554 | biostudies-literature
| S-EPMC4612221 | biostudies-literature
| S-EPMC3042383 | biostudies-literature
| S-EPMC6454616 | biostudies-literature
| S-EPMC3554231 | biostudies-literature
| S-EPMC8414800 | biostudies-literature
| S-EPMC3864256 | biostudies-literature