Dataset Information

Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction.

ABSTRACT: Most current de novo structure prediction methods randomly sample protein conformations and thus require large amounts of computational resource. Here, we consider a sequential sampling strategy, building on ideas from recent experimental work which shows that many proteins fold cotranslationally.We have investigated whether a pseudo-greedy search approach, which begins sequentially from one of the termini, can improve the performance and accuracy of de novo protein structure prediction. We observed that our sequential approach converges when fewer than 20 000 decoys have been produced, fewer than commonly expected. Using our software, SAINT2, we also compared the run time and quality of models produced in a sequential fashion against a standard, non-sequential approach. Sequential prediction produces an individual decoy 1.5-2.5 times faster than non-sequential prediction. When considering the quality of the best model, sequential prediction led to a better model being produced for 31 out of 41 soluble protein validation cases and for 18 out of 24 transmembrane protein cases. Correct models (TM-Score > 0.5) were produced for 29 of these cases by the sequential mode and for only 22 by the non-sequential mode. Our comparison reveals that a sequential search strategy can be used to drastically reduce computational time of de novo protein structure prediction and improve accuracy.Data are available for download from: http://opig.stats.ox.ac.uk/resources. SAINT2 is available for download from: https://github.com/sauloho/SAINT2.saulo.deoliveira@dtc.ox.ac.uk.Supplementary data are available at Bioinformatics online.

SUBMITTER: de Oliveira SHP

PROVIDER: S-EPMC6030820 | biostudies-other | 2018 Apr

REPOSITORIES: biostudies-other

ACCESS DATA

Publications

Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction.

de Oliveira Saulo H P SHP Law Eleanor C EC Shi Jiye J Deane Charlotte M CM

Bioinformatics (Oxford, England) 20180401 7

<h4>Motivation</h4>Most current de novo structure prediction methods randomly sample protein conformations and thus require large amounts of computational resource. Here, we consider a sequential sampling strategy, building on ideas from recent experimental work which shows that many proteins fold cotranslationally.<h4>Results</h4>We have investigated whether a pseudo-greedy search approach, which begins sequentially from one of the termini, can improve the performance and accuracy of de novo pr ...[more]

PMID: 29136098

Dataset Information

Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction.

Publications

Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Similar Datasets

Sequential de novo centromere formation and inactivation on a chromosomal fragment in maize.
| S-EPMC4371999 | biostudies-literature

Building a better fragment library for de novo protein structure prediction.
| S-EPMC4406757 | biostudies-literature

PEP-FOLD3: faster de novo structure prediction for linear peptides in solution and in complex.
| S-EPMC4987898 | biostudies-literature

Crius: A novel fragment-based algorithm of de novo substrate prediction for enzymes.
| S-EPMC6153407 | biostudies-literature

HHMMiR: efficient de novo prediction of microRNAs using hierarchical hidden Markov models.
| S-EPMC2648761 | biostudies-literature

De novo repeat classification and fragment assembly.
| S-EPMC515325 | biostudies-literature

The dual role of fragments in fragment-assembly methods for de novo protein structure prediction.
| S-EPMC3849216 | biostudies-literature

UniCon3D: de novo protein structure prediction using united-residue conformational search via stepwise, probabilistic sampling.
| S-EPMC5018369 | biostudies-literature

De novo prediction of protein folding pathways and structure using the principle of sequential stabilization.
| S-EPMC3491489 | biostudies-literature

Sequential centromere shift via de novo formation in maize
2015-03-03 | E-GEOD-59124 | biostudies-arrayexpress