Unknown

Dataset Information

0

ZCURVE_CoV: a new system to recognize protein coding genes in coronavirus genomes, and its applications in analyzing SARS-CoV genomes.


ABSTRACT: A new system to recognize protein coding genes in the coronavirus genomes, specially suitable for the SARS-CoV genomes, has been proposed in this paper. Compared with some existing systems, the new program package has the merits of simplicity, high accuracy, reliability, and quickness. The system ZCURVE_CoV has been run for each of the 11 newly sequenced SARS-CoV genomes. Consequently, six genomes not annotated previously have been annotated, and some problems of previous annotations in the remaining five genomes have been pointed out and discussed. In addition to the polyprotein chain ORFs 1a and 1b and the four genes coding for the major structural proteins, spike (S), small envelop (E), membrane (M), and nuleocaspid (N), respectively, ZCURVE_CoV also predicts 5-6 putative proteins in length between 39 and 274 amino acids with unknown functions. Some single nucleotide mutations within these putative coding sequences have been detected and their biological implications are discussed. A web service is provided, by which a user can obtain the annotated result immediately by pasting the SARS-CoV genome sequences into the input window on the web site (http://tubic.tju.edu.cn/sars/). The software ZCURVE_CoV can also be downloaded freely from the web address mentioned above and run in computers under the platforms of Windows or Linux.

SUBMITTER: Chen LL 

PROVIDER: S-EPMC7134609 | biostudies-literature | 2003 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

ZCURVE_CoV: a new system to recognize protein coding genes in coronavirus genomes, and its applications in analyzing SARS-CoV genomes.

Chen Ling-Ling LL   Ou Hong-Yu HY   Zhang Ren R   Zhang Chun-Ting CT  

Biochemical and biophysical research communications 20030701 2


A new system to recognize protein coding genes in the coronavirus genomes, specially suitable for the SARS-CoV genomes, has been proposed in this paper. Compared with some existing systems, the new program package has the merits of simplicity, high accuracy, reliability, and quickness. The system ZCURVE_CoV has been run for each of the 11 newly sequenced SARS-CoV genomes. Consequently, six genomes not annotated previously have been annotated, and some problems of previous annotations in the rema  ...[more]

Similar Datasets

| S-EPMC7232748 | biostudies-literature
| S-EPMC9241832 | biostudies-literature
| S-EPMC7799330 | biostudies-literature
2010-06-05 | E-GEOD-546 | biostudies-arrayexpress
| S-EPMC7874498 | biostudies-literature
| S-EPMC9742375 | biostudies-literature
2022-03-08 | E-MTAB-11523 | biostudies-arrayexpress
2020-05-08 | GSE149973 | GEO
| S-EPMC152858 | biostudies-literature
| S-BSST379 | biostudies-other