Unknown

Dataset Information

0

Prediction of proteinase cleavage sites in polyproteins of coronaviruses and its applications in analyzing SARS-CoV genomes.


ABSTRACT: Recently, we have developed a coronavirus-specific gene-finding system, ZCURVE_CoV 1.0. In this paper, the system is further improved by taking the prediction of cleavage sites of viral proteinases in polyproteins into account. The cleavage sites of the 3C-like proteinase and papain-like proteinase are highly conserved. Based on the method of traditional positional weight matrix trained by the peptides around cleavage sites, the present method also sufficiently considers the length conservation of non-structural proteins cleaved by the 3C-like proteinase and papain-like proteinase to reduce the false positive prediction rate. The improved system, ZCURVE_CoV 2.0, has been run for each of the 24 completely sequenced coronavirus genomes in GenBank. Consequently, all the non-structural proteins in the 24 genomes are accurately predicted. Compared with known annotations, the performance of the present method is satisfactory. The software ZCURVE_CoV 2.0 is freely available at http://tubic.tju.edu.cn/sars/.

SUBMITTER: Gao F 

PROVIDER: S-EPMC7232748 | biostudies-literature | 2003 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Prediction of proteinase cleavage sites in polyproteins of coronaviruses and its applications in analyzing SARS-CoV genomes.

Gao Feng F   Ou Hong-Yu HY   Chen Ling-Ling LL   Zheng Wen-Xin WX   Zhang Chun-Ting CT  

FEBS letters 20031001 3


Recently, we have developed a coronavirus-specific gene-finding system, ZCURVE_CoV 1.0. In this paper, the system is further improved by taking the prediction of cleavage sites of viral proteinases in polyproteins into account. The cleavage sites of the 3C-like proteinase and papain-like proteinase are highly conserved. Based on the method of traditional positional weight matrix trained by the peptides around cleavage sites, the present method also sufficiently considers the length conservation  ...[more]

Similar Datasets

| S-EPMC7836551 | biostudies-literature
| S-EPMC8529245 | biostudies-literature
| S-EPMC442122 | biostudies-literature
| S-EPMC7134609 | biostudies-literature
| S-EPMC7833154 | biostudies-literature
| S-EPMC5407059 | biostudies-literature
| S-EPMC1764470 | biostudies-literature
| S-EPMC3767653 | biostudies-literature
| S-EPMC2893604 | biostudies-literature
| S-EPMC152112 | biostudies-literature