Unknown

Dataset Information

0

PCP-ML: protein characterization package for machine learning.


ABSTRACT:

Background

Machine Learning (ML) has a number of demonstrated applications in protein prediction tasks such as protein structure prediction. To speed further development of machine learning based tools and their release to the community, we have developed a package which characterizes several aspects of a protein commonly used for protein prediction tasks with machine learning.

Findings

A number of software libraries and modules exist for handling protein related data. The package we present in this work, PCP-ML, is unique in its small footprint and emphasis on machine learning. Its primary focus is on characterizing various aspects of a protein through sets of numerical data. The generated data can then be used with machine learning tools and/or techniques. PCP-ML is very flexible in how the generated data is formatted and as a result is compatible with a variety of existing machine learning packages. Given its small size, it can be directly packaged and distributed with community developed tools for protein prediction tasks.

Conclusions

Source code and example programs are available under a BSD license at http://mlid.cps.cmich.edu/eickh1jl/tools/PCPML/. The package is implemented in C++ and accessible as a Python module.

SUBMITTER: Eickholt J 

PROVIDER: S-EPMC4246511 | biostudies-literature | 2014 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

PCP-ML: protein characterization package for machine learning.

Eickholt Jesse J   Wang Zheng Z  

BMC research notes 20141118


<h4>Background</h4>Machine Learning (ML) has a number of demonstrated applications in protein prediction tasks such as protein structure prediction. To speed further development of machine learning based tools and their release to the community, we have developed a package which characterizes several aspects of a protein commonly used for protein prediction tasks with machine learning.<h4>Findings</h4>A number of software libraries and modules exist for handling protein related data. The package  ...[more]

Similar Datasets

| S-EPMC10130421 | biostudies-literature
| S-EPMC8139618 | biostudies-literature
| S-EPMC10422334 | biostudies-literature
| S-EPMC8213174 | biostudies-literature
| S-EPMC5467039 | biostudies-literature
| S-EPMC8600276 | biostudies-literature
| S-EPMC5832912 | biostudies-literature
| S-EPMC9900208 | biostudies-literature
2020-04-18 | GSE129474 | GEO