Unknown

Dataset Information

0

Integrated entropy-based approach for analyzing exons and introns in DNA sequences.


ABSTRACT: BACKGROUND:Numerous essential algorithms and methods, including entropy-based quantitative methods, have been developed to analyze complex DNA sequences since the last decade. Exons and introns are the most notable components of DNA and their identification and prediction are always the focus of state-of-the-art research. RESULTS:In this study, we designed an integrated entropy-based analysis approach, which involves modified topological entropy calculation, genomic signal processing (GSP) method and singular value decomposition (SVD), to investigate exons and introns in DNA sequences. We optimized and implemented the topological entropy and the generalized topological entropy to calculate the complexity of DNA sequences, highlighting the characteristics of repetition sequences. By comparing digitalizing entropy values of exons and introns, we observed that they are significantly different. After we converted DNA data to numerical topological entropy value, we applied SVD method to effectively investigate exon and intron regions on a single gene sequence. Additionally, several genes across five species are used for exon predictions. CONCLUSIONS:Our approach not only helps to explore the complexity of DNA sequence and its functional elements, but also provides an entropy-based GSP method to analyze exon and intron regions. Our work is feasible across different species and extendable to analyze other components in both coding and noncoding region of DNA sequences.

SUBMITTER: Li J 

PROVIDER: S-EPMC6557737 | biostudies-literature | 2019 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Integrated entropy-based approach for analyzing exons and introns in DNA sequences.

Li Junyi J   Zhang Li L   Li Huinian H   Ping Yuan Y   Xu Qingzhe Q   Wang Rongjie R   Tan Renjie R   Wang Zhen Z   Liu Bo B   Wang Yadong Y  

BMC bioinformatics 20190610 Suppl 8


<h4>Background</h4>Numerous essential algorithms and methods, including entropy-based quantitative methods, have been developed to analyze complex DNA sequences since the last decade. Exons and introns are the most notable components of DNA and their identification and prediction are always the focus of state-of-the-art research.<h4>Results</h4>In this study, we designed an integrated entropy-based analysis approach, which involves modified topological entropy calculation, genomic signal process  ...[more]

Similar Datasets

| S-EPMC3922877 | biostudies-literature
| S-EPMC4331808 | biostudies-literature
| S-EPMC20846 | biostudies-literature
| S-EPMC8002178 | biostudies-literature
| S-EPMC4745173 | biostudies-literature
| S-EPMC5029481 | biostudies-literature
| S-EPMC2648722 | biostudies-literature
| S-EPMC6035042 | biostudies-literature
| S-EPMC7334200 | biostudies-literature
| S-EPMC8682797 | biostudies-literature