Unknown

Dataset Information

0

EPIFANY: A Method for Efficient High-Confidence Protein Inference.


ABSTRACT: Accurate protein inference in the presence of shared peptides is still one of the key problems in bottom-up proteomics. Most protein inference tools employing simple heuristic inference strategies are efficient but exhibit reduced accuracy. More advanced probabilistic methods often exhibit better inference quality but tend to be too slow for large data sets. Here, we present a novel protein inference method, EPIFANY, combining a loopy belief propagation algorithm with convolution trees for efficient processing of Bayesian networks. We demonstrate that EPIFANY combines the reliable protein inference of Bayesian methods with significantly shorter runtimes. On the 2016 iPRG protein inference benchmark data, EPIFANY is the only tested method that finds all true-positive proteins at a 5% protein false discovery rate (FDR) without strict prefiltering on the peptide-spectrum match (PSM) level, yielding an increase in identification performance (+10% in the number of true positives and +14% in partial AUC) compared to previous approaches. Even very large data sets with hundreds of thousands of spectra (which are intractable with other Bayesian and some non-Bayesian tools) can be processed with EPIFANY within minutes. The increased inference quality including shared peptides results in better protein inference results and thus increased robustness of the biological hypotheses generated. EPIFANY is available as open-source software for all major platforms at https://OpenMS.de/epifany.

SUBMITTER: Pfeuffer J 

PROVIDER: S-EPMC7583457 | biostudies-literature | 2020 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

EPIFANY: A Method for Efficient High-Confidence Protein Inference.

Pfeuffer Julianus J   Sachsenberg Timo T   Dijkstra Tjeerd M H TMH   Serang Oliver O   Reinert Knut K   Kohlbacher Oliver O  

Journal of proteome research 20200213 3


Accurate protein inference in the presence of shared peptides is still one of the key problems in bottom-up proteomics. Most protein inference tools employing simple heuristic inference strategies are efficient but exhibit reduced accuracy. More advanced probabilistic methods often exhibit better inference quality but tend to be too slow for large data sets. Here, we present a novel protein inference method, EPIFANY, combining a loopy belief propagation algorithm with convolution trees for effic  ...[more]

Similar Datasets

| S-EPMC3416704 | biostudies-literature
| S-EPMC5714343 | biostudies-literature
| S-EPMC9249175 | biostudies-literature
| S-EPMC7137106 | biostudies-literature
| S-EPMC3187655 | biostudies-literature
| S-EPMC3650325 | biostudies-literature
2017-04-01 | GSE97211 | GEO
| S-EPMC7453826 | biostudies-literature
| S-EPMC4053744 | biostudies-literature
| S-EPMC2563020 | biostudies-literature