Unknown

Dataset Information

0

Gene Presence/Absence Variation analysis of coronavirus family displays its pan-genomic diversity.


ABSTRACT: SARS-CoV-2 belongs to the coronavirus family. Comparing genomic features of viral genomes of coronavirus family can improve our understanding about SARS-CoV-2. Here we present the first pan-genome analysis of 3,932 whole genomes of 101 species out of 4 genera from the coronavirus family. We found that a total of 181 genes in the pan-genome of coronavirus family, among which only 3 genes, the S gene, M gene and N gene, are highly conserved. We also constructed a pan-genome from 23,539 whole genomes of SARS-CoV-2. There are 13 genes in total in the SARS-CoV-2 pan-genome. All of the 13 genes are core genes for SARS-CoV-2. The pan-genome of coronaviruses shows a lower level of diversity than the pan-genomes of other RNA viruses, which contain no core gene. The three highly conserved genes in coronavirus family, which are also core genes in SARS-CoV-2 pan-genome, could be potential targets in developing nucleic acid diagnostic reagents with a decreased possibility of cross-reaction with other coronavirus species.

SUBMITTER: Jiao D 

PROVIDER: S-EPMC8495401 | biostudies-literature | 2021

REPOSITORIES: biostudies-literature

altmetric image

Publications

Gene Presence/Absence Variation analysis of coronavirus family displays its pan-genomic diversity.

Jiao Du D   Dong Xiaorui X   Yu Yingyan Y   Wei Chaochun C  

International journal of biological sciences 20210827 14


SARS-CoV-2 belongs to the coronavirus family. Comparing genomic features of viral genomes of coronavirus family can improve our understanding about SARS-CoV-2. Here we present the first pan-genome analysis of 3,932 whole genomes of 101 species out of 4 genera from the coronavirus family. We found that a total of 181 genes in the pan-genome of coronavirus family, among which only 3 genes, the S gene, M gene and N gene, are highly conserved. We also constructed a pan-genome from 23,539 whole genom  ...[more]

Similar Datasets

| S-EPMC7653742 | biostudies-literature
| S-EPMC10990425 | biostudies-literature
| S-EPMC10138031 | biostudies-literature
| S-EPMC9949116 | biostudies-literature
| S-EPMC10273549 | biostudies-literature
| S-EPMC5053417 | biostudies-literature
| S-EPMC3433342 | biostudies-literature
| S-EPMC7057980 | biostudies-literature
| S-EPMC3879440 | biostudies-literature
| S-EPMC10514466 | biostudies-literature