Unknown

Dataset Information

0

Improving the diversity of captured full-length isoforms using a normalized single-molecule RNA-sequencing method.


ABSTRACT: Human genes form a large variety of isoforms after transcription, encoding distinct transcripts to exert different functions. Single-molecule RNA sequencing facilitates accurate identification of the isoforms by extending nucleotide read length significantly. However, the gene or isoform diversity is lowly represented by the mRNA molecules captured by single-molecule RNA sequencing. Here, we show that a cDNA normalization procedure before the library preparation for PacBio RS II sequencing captures 3.2-6.0 fold more full-length high-quality isoform species for different human samples, as compared to the non-normalized capture procedure. Many lowly expressed, functionally important isoforms can be detected. In addition, normalized PacBio RNA sequencing also resolves more allele-specific haplotype transcripts. Finally, we apply the cDNA normalization based long-read RNA sequencing method to profile the transcriptome of human gastric signet-ring cell carcinomas, identify new cancer-specific transcriptome signatures, and thus, bring out the utility of the improved protocols in gene expression studies.

SUBMITTER: Hu Y 

PROVIDER: S-EPMC7393167 | biostudies-literature | 2020 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Improving the diversity of captured full-length isoforms using a normalized single-molecule RNA-sequencing method.

Hu Yueming Y   Shu Xing-Sheng XS   Yu Jiaxian J   Sun Ming-An MA   Sun Ming-An MA   Chen Zewei Z   Liu Xianming X   Fang Qiongfang Q   Zhang Wei W   Hui Xinjie X   Ying Ying Y   Fu Li L   Lu Desheng D   Kumar Rakesh R   Wang Yejun Y  

Communications biology 20200730 1


Human genes form a large variety of isoforms after transcription, encoding distinct transcripts to exert different functions. Single-molecule RNA sequencing facilitates accurate identification of the isoforms by extending nucleotide read length significantly. However, the gene or isoform diversity is lowly represented by the mRNA molecules captured by single-molecule RNA sequencing. Here, we show that a cDNA normalization procedure before the library preparation for PacBio RS II sequencing captu  ...[more]

Similar Datasets

2021-06-16 | E-MTAB-7334 | biostudies-arrayexpress
| S-EPMC8270901 | biostudies-literature
| S-EPMC9537161 | biostudies-literature
| S-EPMC6214230 | biostudies-literature
| S-EPMC7584602 | biostudies-literature
| S-EPMC6941099 | biostudies-literature
| S-EPMC6523297 | biostudies-literature
2020-04-30 | GSE149551 | GEO
| S-EPMC6166824 | biostudies-literature
| S-EPMC7214433 | biostudies-literature