Unknown

Dataset Information

0

GIBA: a clustering tool for detecting protein complexes.


ABSTRACT:

Background

During the last years, high throughput experimental methods have been developed which generate large datasets of protein - protein interactions (PPIs). However, due to the experimental methodologies these datasets contain errors mainly in terms of false positive data sets and reducing therefore the quality of any derived information. Typically these datasets can be modeled as graphs, where vertices represent proteins and edges the pairwise PPIs, making it easy to apply automated clustering methods to detect protein complexes or other biological significant functional groupings.

Methods

In this paper, a clustering tool, called GIBA (named by the first characters of its developers' nicknames), is presented. GIBA implements a two step procedure to a given dataset of protein-protein interaction data. First, a clustering algorithm is applied to the interaction data, which is then followed by a filtering step to generate the final candidate list of predicted complexes.

Results

The efficiency of GIBA is demonstrated through the analysis of 6 different yeast protein interaction datasets in comparison to four other available algorithms. We compared the results of the different methods by applying five different performance measurement metrices. Moreover, the parameters of the methods that constitute the filter have been checked on how they affect the final results.

Conclusion

GIBA is an effective and easy to use tool for the detection of protein complexes out of experimentally measured protein - protein interaction networks. The results show that GIBA has superior prediction accuracy than previously published methods.

SUBMITTER: Moschopoulos CN 

PROVIDER: S-EPMC2697634 | biostudies-literature | 2009 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

GIBA: a clustering tool for detecting protein complexes.

Moschopoulos Charalampos N CN   Pavlopoulos Georgios A GA   Schneider Reinhard R   Likothanassis Spiridon D SD   Kossida Sophia S  

BMC bioinformatics 20090616


<h4>Background</h4>During the last years, high throughput experimental methods have been developed which generate large datasets of protein - protein interactions (PPIs). However, due to the experimental methodologies these datasets contain errors mainly in terms of false positive data sets and reducing therefore the quality of any derived information. Typically these datasets can be modeled as graphs, where vertices represent proteins and edges the pairwise PPIs, making it easy to apply automat  ...[more]

Similar Datasets

| S-EPMC3958373 | biostudies-literature
| S-EPMC5773919 | biostudies-literature
| S-EPMC441501 | biostudies-literature
| S-EPMC8734265 | biostudies-literature
| S-EPMC3543700 | biostudies-literature
| S-EPMC4288635 | biostudies-literature
| S-EPMC1366636 | biostudies-literature
| S-EPMC3267700 | biostudies-literature
| S-EPMC3852146 | biostudies-literature
| S-EPMC4671556 | biostudies-literature