Unknown

Dataset Information

0

Nanopore basecalling from a perspective of instance segmentation.


ABSTRACT:

Background

Nanopore sequencing is a rapidly developing third-generation sequencing technology, which can generate long nucleotide reads of molecules within a portable device in real-time. Through detecting the change of ion currency signals during a DNA/RNA fragment's pass through a nanopore, genotypes are determined. Currently, the accuracy of nanopore basecalling has a higher error rate than the basecalling of short-read sequencing. Through utilizing deep neural networks, the-state-of-the art nanopore basecallers achieve basecalling accuracy in a range from 85% to 95%.

Result

In this work, we proposed a novel basecalling approach from a perspective of instance segmentation. Different from previous approaches of doing typical sequence labeling, we formulated the basecalling problem as a multi-label segmentation task. Meanwhile, we proposed a refined U-net model which we call UR-net that can model sequential dependencies for a one-dimensional segmentation task. The experiment results show that the proposed basecaller URnano achieves competitive results on the in-species data, compared to the recently proposed CTC-featured basecallers.

Conclusion

Our results show that formulating the basecalling problem as a one-dimensional segmentation task is a promising approach, which does basecalling and segmentation jointly.

SUBMITTER: Zhang YZ 

PROVIDER: S-EPMC7178565 | biostudies-literature | 2020 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Nanopore basecalling from a perspective of instance segmentation.

Zhang Yao-Zhong YZ   Akdemir Arda A   Tremmel Georg G   Imoto Seiya S   Miyano Satoru S   Shibuya Tetsuo T   Yamaguchi Rui R  

BMC bioinformatics 20200423 Suppl 3


<h4>Background</h4>Nanopore sequencing is a rapidly developing third-generation sequencing technology, which can generate long nucleotide reads of molecules within a portable device in real-time. Through detecting the change of ion currency signals during a DNA/RNA fragment's pass through a nanopore, genotypes are determined. Currently, the accuracy of nanopore basecalling has a higher error rate than the basecalling of short-read sequencing. Through utilizing deep neural networks, the-state-of-  ...[more]

Similar Datasets

| S-EPMC8260538 | biostudies-literature
| S-EPMC8321355 | biostudies-literature
| S-EPMC8794127 | biostudies-literature
| S-EPMC7160130 | biostudies-literature
| S-EPMC7394709 | biostudies-literature
| S-EPMC8114180 | biostudies-literature
| S-EPMC8774909 | biostudies-literature
| S-EPMC8455788 | biostudies-literature
| S-EPMC6436813 | biostudies-literature