Unknown

Dataset Information

0

Chiron: translating nanopore raw signal directly into nucleotide sequence using deep learning.


ABSTRACT: Sequencing by translocating DNA fragments through an array of nanopores is a rapidly maturing technology that offers faster and cheaper sequencing than other approaches. However, accurately deciphering the DNA sequence from the noisy and complex electrical signal is challenging. Here, we report Chiron, the first deep learning model to achieve end-to-end basecalling and directly translate the raw signal to DNA sequence without the error-prone segmentation step. Trained with only a small set of 4,000 reads, we show that our model provides state-of-the-art basecalling accuracy, even on previously unseen species. Chiron achieves basecalling speeds of more than 2,000 bases per second using desktop computer graphics processing units.

SUBMITTER: Teng H 

PROVIDER: S-EPMC5946831 | biostudies-literature | 2018 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Chiron: translating nanopore raw signal directly into nucleotide sequence using deep learning.

Teng Haotian H   Cao Minh Duc MD   Hall Michael B MB   Duarte Tania T   Wang Sheng S   Coin Lachlan J M LJM  

GigaScience 20180501 5


Sequencing by translocating DNA fragments through an array of nanopores is a rapidly maturing technology that offers faster and cheaper sequencing than other approaches. However, accurately deciphering the DNA sequence from the noisy and complex electrical signal is challenging. Here, we report Chiron, the first deep learning model to achieve end-to-end basecalling and directly translate the raw signal to DNA sequence without the error-prone segmentation step. Trained with only a small set of 4,  ...[more]

Similar Datasets

| S-EPMC6511066 | biostudies-literature
2025-01-16 | PXD055252 | Pride
| S-EPMC7682815 | biostudies-literature
| S-EPMC8567335 | biostudies-literature
| S-EPMC5714171 | biostudies-literature
| S-EPMC10825146 | biostudies-literature
| PRJNA843932 | ENA
| S-EPMC7759923 | biostudies-literature
| S-EPMC8183963 | biostudies-literature
| S-EPMC8445976 | biostudies-literature