Unknown

Dataset Information

0

Evolutionary changes in vertebrate genome signatures with special focus on coelacanth.


ABSTRACT: With a remarkable increase in genomic sequence data of a wide range of species, novel tools are needed for comprehensive analyses of the big sequence data. Self-organizing map (SOM) is a powerful tool for clustering high-dimensional data on one plane. For oligonucleotide compositions handled as high-dimensional data, we have previously modified the conventional SOM for genome informatics: BLSOM. In the present study, we constructed BLSOMs for oligonucleotide compositions in fragment sequences (e.g. 100 kb) from a wide range of vertebrates, including coelacanth, and found that the sequences were clustered primarily according to species without species information. As one of the nearest living relatives of tetrapod ancestors, coelacanth is believed to provide access to the phenotypic and genomic transitions leading to the emergence of tetrapods. The characteristic oligonucleotide composition found for coelacanth was connected with the lowest dinucleotide CG occurrence (i.e. the highest CG suppression) among fishes, which was rather equivalent to that of tetrapods. This evident CG suppression in coelacanth should reflect molecular evolutionary processes of epigenetic systems including DNA methylation during vertebrate evolution. Sequence of a de novo DNA methylase (Dntm3a) of coelacanth was found to be more closely related to that of tetrapods than that of other fishes.

SUBMITTER: Iwasaki Y 

PROVIDER: S-EPMC4195492 | biostudies-literature | 2014 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Evolutionary changes in vertebrate genome signatures with special focus on coelacanth.

Iwasaki Yuki Y   Abe Takashi T   Okada Norihiro N   Wada Kennosuke K   Wada Yoshiko Y   Ikemura Toshimichi T  

DNA research : an international journal for rapid publication of reports on genes and genomes 20140506 5


With a remarkable increase in genomic sequence data of a wide range of species, novel tools are needed for comprehensive analyses of the big sequence data. Self-organizing map (SOM) is a powerful tool for clustering high-dimensional data on one plane. For oligonucleotide compositions handled as high-dimensional data, we have previously modified the conventional SOM for genome informatics: BLSOM. In the present study, we constructed BLSOMs for oligonucleotide compositions in fragment sequences (e  ...[more]

Similar Datasets

| S-EPMC534663 | biostudies-literature
| S-EPMC3787270 | biostudies-literature
| S-EPMC4641402 | biostudies-literature
| S-EPMC2315648 | biostudies-literature
| S-EPMC5597290 | biostudies-literature
| S-EPMC5018878 | biostudies-literature
| S-EPMC10927551 | biostudies-literature
| S-EPMC3734258 | biostudies-literature
| S-EPMC3591787 | biostudies-literature
| S-EPMC7760388 | biostudies-literature