Unknown

Dataset Information

0

Detection of DNA base modifications by deep recurrent neural network on Oxford Nanopore sequencing data.


ABSTRACT: DNA base modifications, such as C5-methylcytosine (5mC) and N6-methyldeoxyadenosine (6mA), are important types of epigenetic regulations. Short-read bisulfite sequencing and long-read PacBio sequencing have inherent limitations to detect DNA modifications. Here, using raw electric signals of Oxford Nanopore long-read sequencing data, we design DeepMod, a bidirectional recurrent neural network (RNN) with long short-term memory (LSTM) to detect DNA modifications. We sequence a human genome HX1 and a Chlamydomonas reinhardtii genome using Nanopore sequencing, and then evaluate DeepMod on three types of genomes (Escherichia coli, Chlamydomonas reinhardtii and human genomes). For 5mC detection, DeepMod achieves average precision up to 0.99 for both synthetically introduced and naturally occurring modifications. For 6mA detection, DeepMod achieves ~0.9 average precision on Escherichia coli data, and have improved performance than existing methods on Chlamydomonas reinhardtii data. In conclusion, DeepMod performs well for genome-scale detection of DNA modifications and will facilitate epigenetic analysis on diverse species.

SUBMITTER: Liu Q 

PROVIDER: S-EPMC6547721 | biostudies-literature | 2019 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Detection of DNA base modifications by deep recurrent neural network on Oxford Nanopore sequencing data.

Liu Qian Q   Fang Li L   Yu Guoliang G   Wang Depeng D   Xiao Chuan-Le CL   Wang Kai K  

Nature communications 20190604 1


DNA base modifications, such as C5-methylcytosine (5mC) and N6-methyldeoxyadenosine (6mA), are important types of epigenetic regulations. Short-read bisulfite sequencing and long-read PacBio sequencing have inherent limitations to detect DNA modifications. Here, using raw electric signals of Oxford Nanopore long-read sequencing data, we design DeepMod, a bidirectional recurrent neural network (RNN) with long short-term memory (LSTM) to detect DNA modifications. We sequence a human genome HX1 and  ...[more]

Similar Datasets

| S-EPMC6591954 | biostudies-literature
| S-EPMC5459436 | biostudies-literature
| S-EPMC6245502 | biostudies-literature
2023-02-04 | GSE224018 | GEO
| S-EPMC5513335 | biostudies-literature
| S-EPMC5093776 | biostudies-literature
| S-EPMC5124260 | biostudies-literature
| S-EPMC7470954 | biostudies-literature
| S-EPMC6233134 | biostudies-other
| S-EPMC8402922 | biostudies-literature