Unknown

Dataset Information

0

A novel method for identifying key genes in macroevolution based on deep learning with attention mechanism.


ABSTRACT: Macroevolution can be regarded as the result of evolutionary changes of synergistically acting genes. Unfortunately, the importance of these genes in macroevolution is difficult to assess and hence the identification of macroevolutionary key genes is a major challenge in evolutionary biology. In this study, we designed various word embedding libraries of natural language processing (NLP) considering the multiple mechanisms of evolutionary genomics. A novel method (IKGM) based on three types of attention mechanisms (domain attention, kmer attention and fused attention) were proposed to calculate the weights of different genes in macroevolution. Taking 34 species of diurnal butterflies and nocturnal moths in Lepidoptera as an example, we identified a few of key genes with high weights, which annotated to the functions of circadian rhythms, sensory organs, as well as behavioral habits etc. This study not only provides a novel method to identify the key genes of macroevolution at the genomic level, but also helps us to understand the microevolution mechanisms of diurnal butterflies and nocturnal moths in Lepidoptera.

SUBMITTER: Mao J 

PROVIDER: S-EPMC10643560 | biostudies-literature | 2023 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

A novel method for identifying key genes in macroevolution based on deep learning with attention mechanism.

Mao Jiawei J   Cao Yong Y   Zhang Yan Y   Huang Biaosheng B   Zhao Youjie Y  

Scientific reports 20231113 1


Macroevolution can be regarded as the result of evolutionary changes of synergistically acting genes. Unfortunately, the importance of these genes in macroevolution is difficult to assess and hence the identification of macroevolutionary key genes is a major challenge in evolutionary biology. In this study, we designed various word embedding libraries of natural language processing (NLP) considering the multiple mechanisms of evolutionary genomics. A novel method (IKGM) based on three types of a  ...[more]

Similar Datasets

2025-03-16 | GSE262245 | GEO
| S-EPMC10301803 | biostudies-literature
| S-EPMC7215070 | biostudies-literature
| S-EPMC11440089 | biostudies-literature
| S-EPMC8379737 | biostudies-literature
| S-EPMC7125446 | biostudies-literature
| S-EPMC11359982 | biostudies-literature
| S-EPMC10094198 | biostudies-literature
| S-EPMC9313278 | biostudies-literature
| S-EPMC7994531 | biostudies-literature