Unknown

Dataset Information

0

DeepSol: a deep learning framework for sequence-based protein solubility prediction.


ABSTRACT: Motivation:Protein solubility plays a vital role in pharmaceutical research and production yield. For a given protein, the extent of its solubility can represent the quality of its function, and is ultimately defined by its sequence. Thus, it is imperative to develop novel, highly accurate in silico sequence-based protein solubility predictors. In this work we propose, DeepSol, a novel Deep Learning-based protein solubility predictor. The backbone of our framework is a convolutional neural network that exploits k-mer structure and additional sequence and structural features extracted from the protein sequence. Results:DeepSol outperformed all known sequence-based state-of-the-art solubility prediction methods and attained an accuracy of 0.77 and Matthew's correlation coefficient of 0.55. The superior prediction accuracy of DeepSol allows to screen for sequences with enhanced production capacity and can more reliably predict solubility of novel proteins. Availability and implementation:DeepSol's best performing models and results are publicly deposited at https://doi.org/10.5281/zenodo.1162886 (Khurana and Mall, 2018). Supplementary information:Supplementary data are available at Bioinformatics online.

SUBMITTER: Khurana S 

PROVIDER: S-EPMC6355112 | biostudies-literature | 2018 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

DeepSol: a deep learning framework for sequence-based protein solubility prediction.

Khurana Sameer S   Rawi Reda R   Kunji Khalid K   Chuang Gwo-Yu GY   Bensmail Halima H   Mall Raghvendra R  

Bioinformatics (Oxford, England) 20180801 15


<h4>Motivation</h4>Protein solubility plays a vital role in pharmaceutical research and production yield. For a given protein, the extent of its solubility can represent the quality of its function, and is ultimately defined by its sequence. Thus, it is imperative to develop novel, highly accurate in silico sequence-based protein solubility predictors. In this work we propose, DeepSol, a novel Deep Learning-based protein solubility predictor. The backbone of our framework is a convolutional neur  ...[more]

Similar Datasets

| S-EPMC5445391 | biostudies-literature
| S-EPMC10627365 | biostudies-literature
| S-EPMC9046255 | biostudies-literature
| S-EPMC6031027 | biostudies-literature
| S-EPMC9096921 | biostudies-literature
| S-EPMC9535432 | biostudies-literature
| S-EPMC8443569 | biostudies-literature
| S-EPMC7959776 | biostudies-literature
| S-EPMC6129267 | biostudies-literature
2023-03-31 | GSE165175 | GEO