Unknown

Dataset Information

0

2passtools: two-pass alignment using machine-learning-filtered splice junctions increases the accuracy of intron detection in long-read RNA sequencing.


ABSTRACT: Transcription of eukaryotic genomes involves complex alternative processing of RNAs. Sequencing of full-length RNAs using long reads reveals the true complexity of processing. However, the relatively high error rates of long-read sequencing technologies can reduce the accuracy of intron identification. Here we apply alignment metrics and machine-learning-derived sequence information to filter spurious splice junctions from long-read alignments and use the remaining junctions to guide realignment in a two-pass approach. This method, available in the software package 2passtools ( https://github.com/bartongroup/2passtools ), improves the accuracy of spliced alignment and transcriptome assembly for species both with and without existing high-quality annotations.

SUBMITTER: Parker MT 

PROVIDER: S-EPMC7919322 | biostudies-literature | 2021 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

2passtools: two-pass alignment using machine-learning-filtered splice junctions increases the accuracy of intron detection in long-read RNA sequencing.

Parker Matthew T MT   Knop Katarzyna K   Barton Geoffrey J GJ   Simpson Gordon G GG  

Genome biology 20210301 1


Transcription of eukaryotic genomes involves complex alternative processing of RNAs. Sequencing of full-length RNAs using long reads reveals the true complexity of processing. However, the relatively high error rates of long-read sequencing technologies can reduce the accuracy of intron identification. Here we apply alignment metrics and machine-learning-derived sequence information to filter spurious splice junctions from long-read alignments and use the remaining junctions to guide realignment  ...[more]

Similar Datasets

| S-EPMC5006238 | biostudies-other
| S-EPMC3464235 | biostudies-literature
| S-EPMC5103424 | biostudies-literature
| S-EPMC6192213 | biostudies-literature
| S-EPMC6307148 | biostudies-literature
2013-07-15 | E-MTAB-1728 | biostudies-arrayexpress
2021-06-30 | GSE165527 | GEO
| S-EPMC5203714 | biostudies-literature
| S-EPMC2672628 | biostudies-literature
| S-EPMC4446445 | biostudies-literature