Unknown

Dataset Information

0

Multi-instance multilabel learning with weak-label for predicting protein function in electricigens.


ABSTRACT: Nature often brings several domains together to form multidomain and multifunctional proteins with a vast number of possibilities. In our previous study, we disclosed that the protein function prediction problem is naturally and inherently Multi-Instance Multilabel (MIML) learning tasks. Automated protein function prediction is typically implemented under the assumption that the functions of labeled proteins are complete; that is, there are no missing labels. In contrast, in practice just a subset of the functions of a protein are known, and whether this protein has other functions is unknown. It is evident that protein function prediction tasks suffer from weak-label problem; thus protein function prediction with incomplete annotation matches well with the MIML with weak-label learning framework. In this paper, we have applied the state-of-the-art MIML with weak-label learning algorithm MIMLwel for predicting protein functions in two typical real-world electricigens organisms which have been widely used in microbial fuel cells (MFCs) researches. Our experimental results validate the effectiveness of MIMLwel algorithm in predicting protein functions with incomplete annotation.

SUBMITTER: Wu JS 

PROVIDER: S-EPMC4436452 | biostudies-literature | 2015

REPOSITORIES: biostudies-literature

altmetric image

Publications

Multi-instance multilabel learning with weak-label for predicting protein function in electricigens.

Wu Jian-Sheng JS   Hu Hai-Feng HF   Yan Shan-Cheng SC   Tang Li-Hua LH  

BioMed research international 20150505


Nature often brings several domains together to form multidomain and multifunctional proteins with a vast number of possibilities. In our previous study, we disclosed that the protein function prediction problem is naturally and inherently Multi-Instance Multilabel (MIML) learning tasks. Automated protein function prediction is typically implemented under the assumption that the functions of labeled proteins are complete; that is, there are no missing labels. In contrast, in practice just a subs  ...[more]

Similar Datasets

| S-EPMC9083914 | biostudies-literature
| S-EPMC3951281 | biostudies-literature
| S-EPMC4697333 | biostudies-literature
| S-EPMC4634905 | biostudies-literature
| S-EPMC4212833 | biostudies-literature
| S-EPMC5374972 | biostudies-other
| S-EPMC5463197 | biostudies-other
| S-EPMC6349967 | biostudies-literature
| S-EPMC3374840 | biostudies-literature
| S-EPMC6203325 | biostudies-literature