Dataset Information

MUFOLD-DB: a processed protein structure database for protein structure prediction and analysis.

ABSTRACT:

Background

Protein structure data in Protein Data Bank (PDB) are widely used in studies of protein function and evolution and in protein structure prediction. However, there are two main barriers in large-scale usage of PDB data: 1) PDB data are highly redundant in terms of sequence and structure similarity; and 2) many PDB files have issues due to inconsistency of data and standards as well as missing residues, so that automated retrieval and analysis are often difficult.

Description

To address these issues, we have created MUFOLD-DB http://mufold.org/mufolddb.php, a web-based database, to collect and process the weekly PDB files thereby providing users with non-redundant, cleaned and partially-predicted structure data. For each of the non-redundant sequences, we annotate the SCOP domain classification and predict structures of missing regions by loop modelling. In addition, evolutional information, secondary structure, disorder region, and processed three-dimensional structure are computed and visualized to help users better understand the protein.

Conclusions

MUFOLD-DB integrates processed PDB sequence and structure data and multiple computational results, provides a friendly interface for users to retrieve, browse and download these data, and offers several useful functionalities to facilitate users' data operation.

SUBMITTER: He Z

PROVIDER: S-EPMC4304177 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

MUFOLD-DB: a processed protein structure database for protein structure prediction and analysis.

He Zhiquan Z Zhang Chao C Xu Yang Y Zeng Shuai S Zhang Jingfen J Xu Dong D

BMC genomics 20141216

<h4>Background</h4>Protein structure data in Protein Data Bank (PDB) are widely used in studies of protein function and evolution and in protein structure prediction. However, there are two main barriers in large-scale usage of PDB data: 1) PDB data are highly redundant in terms of sequence and structure similarity; and 2) many PDB files have issues due to inconsistency of data and standards as well as missing residues, so that automated retrieval and analysis are often difficult.<h4>Description ...[more]

PMID: 25559128

Dataset Information

MUFOLD-DB: a processed protein structure database for protein structure prediction and analysis.

Background

Description

Conclusions

Publications

MUFOLD-DB: a processed protein structure database for protein structure prediction and analysis.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Microarray meta-analysis database (M(2)DB): a uniformly pre-processed, quality controlled, and manually curated human clinical microarray database.
| S-EPMC2928207 | biostudies-literature

NSort/DB: an intranuclear compartment protein database.
| S-EPMC5054713 | biostudies-literature

THE-DB: a threading model database for comparative protein structure analysis of the E. coli K12 and human proteomes.
| S-EPMC6146127 | biostudies-literature

P(3)DB: An Integrated Database for Plant Protein Phosphorylation.
| S-EPMC3435559 | biostudies-literature

The iPPI-DB initiative: a community-centered database of protein-protein interaction modulators.
| S-EPMC8034526 | biostudies-literature

Database of RNA binding protein expression and disease dynamics (READ DB).
| S-EPMC4515031 | biostudies-literature

PCRPi-DB: a database of computationally annotated hot spots in protein interfaces.
| S-EPMC3013674 | biostudies-literature

3DCONS-DB: A Database of Position-Specific Scoring Matrices in Protein Structures.
| S-EPMC6149929 | biostudies-literature

NemChR-DB: a database of parasitic nematode chemosensory G-protein coupled receptors.
| S-EPMC8035219 | biostudies-literature

Decoys 'R' Us: a database of incorrect conformations to improve protein structure prediction.
| S-EPMC2144680 | biostudies-other