Other

Dataset Information

0

Precise Transcript Reconstruction with End-Guided Assembly


ABSTRACT: Accurate annotation of transcript isoforms is crucial to understand gene functions, but automated methods for reconstructing full-length transcripts from RNA sequencing (RNA-seq) data remain imprecise. We developed Bookend, a software package for transcript assembly that incorporates data from different RNA-seq techniques, with a focus on identifying and utilizing RNA 5′ and 3′ ends. Through end-guided assembly with Bookend we demonstrate that correct modeling of transcript start and end sites is essential for precise transcript assembly. Furthermore, we discovered that utilization of end-labeled reads present in full-length single-cell RNA-seq (scRNA-seq) datasets dramatically improves the precision of transcript assembly in single cells. Finally, we show that hybrid assembly across short-read, long-read, and end-capture RNA-seq datasets from Arabidopsis, as well as meta-assembly of RNA-seq from single mouse embryonic stem cells (mESCs) can produce end-to-end transcript annotations of comparable quality to reference annotations in these model organisms.

ORGANISM(S): Arabidopsis thaliana

PROVIDER: GSE189482 | GEO | 2022/01/08

REPOSITORIES: GEO

Similar Datasets

2021-02-02 | PXD023373 | Pride
2017-01-20 | GSE93848 | GEO
2023-06-06 | GSE213984 | GEO
2023-04-24 | GSE229621 | GEO
2023-10-14 | GSE215357 | GEO
2023-10-14 | GSE215355 | GEO
2020-03-18 | GSE147118 | GEO
2014-09-25 | E-GEOD-57862 | biostudies-arrayexpress
2019-06-15 | GSE132766 | GEO
2024-05-28 | GSE225361 | GEO