Unknown

Dataset Information

0

Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction.


ABSTRACT: Most current de novo structure prediction methods randomly sample protein conformations and thus require large amounts of computational resource. Here, we consider a sequential sampling strategy, building on ideas from recent experimental work which shows that many proteins fold cotranslationally.We have investigated whether a pseudo-greedy search approach, which begins sequentially from one of the termini, can improve the performance and accuracy of de novo protein structure prediction. We observed that our sequential approach converges when fewer than 20 000 decoys have been produced, fewer than commonly expected. Using our software, SAINT2, we also compared the run time and quality of models produced in a sequential fashion against a standard, non-sequential approach. Sequential prediction produces an individual decoy 1.5-2.5 times faster than non-sequential prediction. When considering the quality of the best model, sequential prediction led to a better model being produced for 31 out of 41 soluble protein validation cases and for 18 out of 24 transmembrane protein cases. Correct models (TM-Score > 0.5) were produced for 29 of these cases by the sequential mode and for only 22 by the non-sequential mode. Our comparison reveals that a sequential search strategy can be used to drastically reduce computational time of de novo protein structure prediction and improve accuracy.Data are available for download from: http://opig.stats.ox.ac.uk/resources. SAINT2 is available for download from: https://github.com/sauloho/SAINT2.saulo.deoliveira@dtc.ox.ac.uk.Supplementary data are available at Bioinformatics online.

SUBMITTER: de Oliveira SHP 

PROVIDER: S-EPMC6030820 | biostudies-other | 2018 Apr

REPOSITORIES: biostudies-other

altmetric image

Publications

Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction.

de Oliveira Saulo H P SHP   Law Eleanor C EC   Shi Jiye J   Deane Charlotte M CM  

Bioinformatics (Oxford, England) 20180401 7


<h4>Motivation</h4>Most current de novo structure prediction methods randomly sample protein conformations and thus require large amounts of computational resource. Here, we consider a sequential sampling strategy, building on ideas from recent experimental work which shows that many proteins fold cotranslationally.<h4>Results</h4>We have investigated whether a pseudo-greedy search approach, which begins sequentially from one of the termini, can improve the performance and accuracy of de novo pr  ...[more]

Similar Datasets

| S-EPMC4371999 | biostudies-literature
| S-EPMC4406757 | biostudies-literature
| S-EPMC4987898 | biostudies-literature
| S-EPMC6153407 | biostudies-literature
| S-EPMC2648761 | biostudies-literature
| S-EPMC515325 | biostudies-literature
| S-EPMC3849216 | biostudies-literature
| S-EPMC5018369 | biostudies-literature
| S-EPMC3491489 | biostudies-literature
2015-03-03 | E-GEOD-59124 | biostudies-arrayexpress