Unknown

Dataset Information

0

GEVE: a genome-based endogenous viral element database provides comprehensive viral protein-coding sequences in mammalian genomes.


ABSTRACT: In mammals, approximately 10% of genome sequences correspond to endogenous viral elements (EVEs), which are derived from ancient viral infections of germ cells. Although most EVEs have been inactivated, some open reading frames (ORFs) of EVEs obtained functions in the hosts. However, EVE ORFs usually remain unannotated in the genomes, and no databases are available for EVE ORFs. To investigate the function and evolution of EVEs in mammalian genomes, we developed EVE ORF databases for 20 genomes of 19 mammalian species. A total of 736,771 non-overlapping EVE ORFs were identified and archived in a database named gEVE (http://geve.med.u-tokai.ac.jp). The gEVE database provides nucleotide and amino acid sequences, genomic loci and functional annotations of EVE ORFs for all 20 genomes. In analyzing RNA-seq data with the gEVE database, we successfully identified the expressed EVE genes, suggesting that the gEVE database facilitates studies of the genomic analyses of various mammalian species.Database URL: http://geve.med.u-tokai.ac.jp.

SUBMITTER: Nakagawa S 

PROVIDER: S-EPMC4885607 | biostudies-literature | 2016

REPOSITORIES: biostudies-literature

altmetric image

Publications

gEVE: a genome-based endogenous viral element database provides comprehensive viral protein-coding sequences in mammalian genomes.

Nakagawa So S   Takahashi Mahoko Ueda MU  

Database : the journal of biological databases and curation 20160530


In mammals, approximately 10% of genome sequences correspond to endogenous viral elements (EVEs), which are derived from ancient viral infections of germ cells. Although most EVEs have been inactivated, some open reading frames (ORFs) of EVEs obtained functions in the hosts. However, EVE ORFs usually remain unannotated in the genomes, and no databases are available for EVE ORFs. To investigate the function and evolution of EVEs in mammalian genomes, we developed EVE ORF databases for 20 genomes  ...[more]

Similar Datasets

| S-EPMC4897596 | biostudies-literature
| S-EPMC1347423 | biostudies-literature
| S-EPMC5210518 | biostudies-literature
| S-EPMC8201709 | biostudies-literature
| S-EPMC2987831 | biostudies-literature
| S-EPMC1395342 | biostudies-literature
| S-EPMC7156643 | biostudies-literature
| S-EPMC5452135 | biostudies-literature
| S-EPMC10675110 | biostudies-literature
| S-EPMC540043 | biostudies-literature