Unknown

Dataset Information

0

IDNA-MS: An Integrated Computational Tool for Detecting DNA Modification Sites in Multiple Genomes.


ABSTRACT: 5hmC, 6mA, and 4mC are three common DNA modifications and are involved in various of biological processes. Accurate genome-wide identification of these sites is invaluable for better understanding their biological functions. Owing to the labor-intensive and expensive nature of experimental methods, it is urgent to develop computational methods for the genome-wide detection of these sites. Keeping this in mind, the current study was devoted to construct a computational method to identify 5hmC, 6mA, and 4mC. We initially used K-tuple nucleotide component, nucleotide chemical property and nucleotide frequency, and mono-nucleotide binary encoding scheme to formulate samples. Subsequently, random forest was utilized to identify 5hmC, 6mA, and 4mC sites. Cross-validated results showed that the proposed method could produce the excellent generalization ability in the identification of the three modification sites. Based on the proposed model, a web-server called iDNA-MS was established and is freely accessible at http://lin-group.cn/server/iDNA-MS.

SUBMITTER: Lv H 

PROVIDER: S-EPMC7115099 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC8044371 | biostudies-literature
| S-EPMC6746913 | biostudies-literature
| S-EPMC4910163 | biostudies-literature
| S-EPMC4581360 | biostudies-literature
| S-EPMC7509369 | biostudies-literature
| S-EPMC1131891 | biostudies-literature
| S-EPMC11360052 | biostudies-literature
| S-EPMC3677880 | biostudies-literature
| S-EPMC2633013 | biostudies-literature
| S-EPMC4210226 | biostudies-literature