Unknown

Dataset Information

0

Long Read Single-Molecule Real-Time Sequencing Elucidates Transcriptome-Wide Heterogeneity and Complexity in Esophageal Squamous Cells.


ABSTRACT: Esophageal squamous cell carcinoma is a leading cause of cancer death. Mapping the transcriptional landscapes such as isoforms, fusion transcripts, as well as long noncoding RNAs have played a central role to understand the regulating mechanism during malignant processes. However, canonical methods such as short-read RNA-seq are difficult to define the entire polyadenylated RNA molecules. Here, we combined single-molecule real-time sequencing with RNA-seq to generate high-quality long reads and to survey the transcriptional program in esophageal squamous cells. Compared with the recent annotations of human transcriptome (Ensembl 38 release 91), single-molecule real-time data identified many unannotated transcripts, novel isoforms of known genes and an expanding repository of long intergenic noncoding RNAs (lincRNAs). By integrating with annotation of lincRNA catalog, 1,521 esophageal-cancer-specific lincRNAs were defined from single-molecule real-time reads. Kyoto Encyclopedia of Genes and Genomes enrichment analysis indicated that these lincRNAs and their target genes are involved in a variety of cancer signaling pathways. Isoform usage analysis revealed the shifted alternative splicing patterns, which can be recaptured from clinical samples or supported by previous studies. Utilizing vigorous searching criteria, we also detected multiple transcript fusions, which are not documented in current gene fusion database or readily identified from RNA-seq reads. Two novel fusion transcripts were verified based on real-time PCR and Sanger sequencing. Overall, our long-read single-molecule sequencing largely expands current understanding of full-length transcriptome in esophageal cells and provides novel insights on the transcriptional diversity during oncogenic transformation.

SUBMITTER: Cheng YW 

PROVIDER: S-EPMC6787290 | biostudies-literature | 2019

REPOSITORIES: biostudies-literature

altmetric image

Publications

Long Read Single-Molecule Real-Time Sequencing Elucidates Transcriptome-Wide Heterogeneity and Complexity in Esophageal Squamous Cells.

Cheng Yin-Wei YW   Chen Yun-Mei YM   Zhao Qian-Qian QQ   Zhao Xing X   Wu Ya-Ru YR   Chen Dan-Ze DZ   Liao Lian-Di LD   Chen Yang Y   Yang Qian Q   Xu Li-Yan LY   Li En-Min EM   Xu Jian-Zhen JZ  

Frontiers in genetics 20191004


Esophageal squamous cell carcinoma is a leading cause of cancer death. Mapping the transcriptional landscapes such as isoforms, fusion transcripts, as well as long noncoding RNAs have played a central role to understand the regulating mechanism during malignant processes. However, canonical methods such as short-read RNA-seq are difficult to define the entire polyadenylated RNA molecules. Here, we combined single-molecule real-time sequencing with RNA-seq to generate high-quality long reads and  ...[more]

Similar Datasets

| S-EPMC4931018 | biostudies-literature
| S-EPMC6105124 | biostudies-literature
| S-EPMC8683407 | biostudies-literature
| S-EPMC6821003 | biostudies-literature
| S-EPMC8320875 | biostudies-literature
| S-EPMC7534443 | biostudies-literature
| S-EPMC5335617 | biostudies-literature
| S-EPMC7359550 | biostudies-literature
| S-EPMC6441658 | biostudies-literature
| S-EPMC6240054 | biostudies-literature