Unknown

Dataset Information

0

Fast and accurate matching of cellular barcodes across short-reads and long-reads of single-cell RNA-seq experiments.


ABSTRACT: Single-cell RNA sequencing allows for characterizing the gene expression landscape at the cell type level. However, because of its use of short-reads, it is severely limited at detecting full-length features of transcripts such as alternative splicing. New library preparation techniques attempt to extend single-cell sequencing by utilizing both long-reads and short-reads. These techniques split the library material, after it is tagged with cellular barcodes, into two pools: one for short-read sequencing and one for long-read sequencing. However, the challenge of utilizing these techniques is that they require matching the cellular barcodes sequenced by the erroneous long-reads to the cellular barcodes detected by the short-reads. To overcome this challenge, we introduce scTagger, a computational method to match cellular barcodes data from long-reads and short-reads. We tested scTagger against another state-of-the-art tool on both real and simulated datasets, and we demonstrate that scTagger has both significantly better accuracy and time efficiency.

SUBMITTER: Ebrahimi G 

PROVIDER: S-EPMC9209721 | biostudies-literature | 2022 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Fast and accurate matching of cellular barcodes across short-reads and long-reads of single-cell RNA-seq experiments.

Ebrahimi Ghazal G   Orabi Baraa B   Robinson Meghan M   Chauve Cedric C   Flannigan Ryan R   Hach Faraz F  

iScience 20220607 7


Single-cell RNA sequencing allows for characterizing the gene expression landscape at the cell type level. However, because of its use of short-reads, it is severely limited at detecting full-length features of transcripts such as alternative splicing. New library preparation techniques attempt to extend single-cell sequencing by utilizing both long-reads and short-reads. These techniques split the library material, after it is tagged with cellular barcodes, into two pools: one for short-read se  ...[more]

Similar Datasets

| S-EPMC6659269 | biostudies-literature
| S-EPMC7276436 | biostudies-literature
| S-EPMC3424124 | biostudies-literature
| S-EPMC4605292 | biostudies-literature
| S-EPMC2952873 | biostudies-literature
| S-EPMC4615873 | biostudies-literature
| S-EPMC4889935 | biostudies-literature
| S-EPMC3614465 | biostudies-other
| S-EPMC4411664 | biostudies-literature
| S-EPMC5834899 | biostudies-literature