Unknown

Dataset Information

0

Gaussian mixture model-based unsupervised nucleotide modification number detection using nanopore-sequencing readouts.


ABSTRACT:

Motivation

Nucleotide modification status can be decoded from the Oxford Nanopore Technologies nanopore-sequencing ionic current signals. Although various algorithms have been developed for nanopore-sequencing-based modification analysis, more detailed characterizations, such as modification numbers, corresponding signal levels and proportions are still lacking.

Results

We present a framework for the unsupervised determination of the number of nucleotide modifications from nanopore-sequencing readouts. We demonstrate the approach can effectively recapitulate the number of modifications, the corresponding ionic current signal levels, as well as mixing proportions under both DNA and RNA contexts. We further show, by integrating information from multiple detected modification regions, that the modification status of DNA and RNA molecules can be inferred. This method forms a key step of de novo characterization of nucleotide modifications, shedding light on the interpretation of various biological questions.

Availability and implementation

Modified nanopolish: https://github.com/adbailey4/nanopolish/tree/cigar_output. All other codes used to reproduce the results: https://github.com/hd2326/ModificationNumber.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Ding H 

PROVIDER: S-EPMC7723331 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC2717951 | biostudies-literature
| S-EPMC8595663 | biostudies-literature
2021-09-20 | GSE173688 | GEO
| S-EPMC11339354 | biostudies-literature
| S-EPMC8677041 | biostudies-literature
| S-EPMC10403694 | biostudies-literature
2023-02-04 | GSE224018 | GEO
| S-EPMC4855479 | biostudies-literature
| S-EPMC10803047 | biostudies-literature
| S-EPMC10552309 | biostudies-literature