Dataset Information

PiPred - a deep-learning method for prediction of π-helices in protein sequences.

ABSTRACT: Canonical π-helices are short, relatively unstable secondary structure elements found in proteins. They comprise seven or more residues and are present in 15% of all known protein structures, often in functionally important regions such as ligand- and ion-binding sites. Given their similarity to α-helices, the prediction of π-helices is a challenging task and none of the currently available secondary structure prediction methods tackle it. Here, we present PiPred, a neural network-based tool for predicting π-helices in protein sequences. By performing a rigorous benchmark we show that PiPred can detect π-helices with a per-residue precision of 48% and sensitivity of 46%. Interestingly, some of the α-helices mispredicted by PiPred as π-helices exhibit a geometry characteristic of π-helices. Also, despite being trained only with canonical π-helices, PiPred can identify 6-residue-long α/π-bulges. These observations suggest an even higher effective precision of the method and demonstrate that π-helices, α/π-bulges, and other helical deformations may impose similar constraints on sequences. PiPred is freely accessible at: https://toolkit.tuebingen.mpg.de/#/tools/quick2d . A standalone version is available for download at: https://github.com/labstructbioinf/PiPred , where we also provide the CB6133, CB513, CASP10, and CASP11 datasets, commonly used for training and validation of secondary structure prediction methods, with correctly annotated π-helices.

SUBMITTER: Ludwiczak J

PROVIDER: S-EPMC6499831 | biostudies-literature | 2019 May

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

PiPred - a deep-learning method for prediction of π-helices in protein sequences.

Ludwiczak Jan J Winski Aleksander A da Silva Neto Antonio Marinho AM Szczepaniak Krzysztof K Alva Vikram V Dunin-Horkawicz Stanislaw S

Scientific reports 20190503 1

Canonical π-helices are short, relatively unstable secondary structure elements found in proteins. They comprise seven or more residues and are present in 15% of all known protein structures, often in functionally important regions such as ligand- and ion-binding sites. Given their similarity to α-helices, the prediction of π-helices is a challenging task and none of the currently available secondary structure prediction methods tackle it. Here, we present PiPred, a neural network-based tool for ...[more]

PMID: 31053765

Dataset Information

PiPred - a deep-learning method for prediction of π-helices in protein sequences.

Publications

PiPred - a deep-learning method for prediction of π-helices in protein sequences.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Protein Secondary Structure Prediction With a Reductive Deep Learning Method.
| S-EPMC8240957 | biostudies-literature

DL-PPI: a method on prediction of sequenced protein-protein interaction based on deep learning.
| S-EPMC10722729 | biostudies-literature

DeepConv-DTI: Prediction of drug-target interactions via deep learning with convolution on protein sequences.
| S-EPMC6594651 | biostudies-literature

Characterizing Promoter and Enhancer Sequences by a Deep Learning Method.
| S-EPMC8239401 | biostudies-literature

CoCoNat: A Deep Learning–Based Tool for the Prediction of Coiled-coil Domains in Protein Sequences
| S-EPMC10883893 | biostudies-literature

DeepM6ASeq: prediction and characterization of m6A-containing sequences using deep learning.
| S-EPMC6311933 | biostudies-literature

EFG-CS: Predicting chemical shifts from amino acid sequences with protein structure prediction using machine learning and deep learning models.
| S-EPMC11232051 | biostudies-literature

DeepDISE: DNA Binding Site Prediction Using a Deep Learning Method.
| S-EPMC8197219 | biostudies-literature

Training deep learning models on personalized genomic sequences improves variant effect prediction.
| S-EPMC11507713 | biostudies-literature

Rapid protein stability prediction using deep learning representations.
| S-EPMC10266766 | biostudies-literature