Transcriptomics

Dataset Information

0

Ultra-low error synthetic long-read single-cell sequencing reveals expressions of hypermutation clusters of isoforms in human liver cancer cells


ABSTRACT: The protein diversity of mammalian cells is determined by arrays of isoforms from genes. Protein mutation is essential in species evolution and cancer development. Accurate Long-read transcriptome sequencing at single-cell level is required to decipher the spectrum of protein expressions in mammalian organisms. In this report, we developed a synthetic long-read single-cell sequencing technology based on LOOPseq technique. We applied this technology to analyze 447 transcriptomes of hepatocellular carcinoma (HCC) and benign liver from an individual. Through Uniform Manifold Approximation and Projection (UMAP) analysis, we identified a panel of mutation mRNA isoforms highly specific to HCC cells. The evolution pathways that led to the hyper-mutation clusters in single human leukocyte antigen (HLA) molecules were identified. Novel fusion transcripts were detected. The combination of gene expressions, fusion gene transcripts, and mutation gene expressions significantly improved the classification of liver cancer cells versus benign hepatocytes. In conclusion, LOOPseq single-cell technology may hold promise to provide a new level of precision analysis on the mammalian transcriptome.

ORGANISM(S): Homo sapiens

PROVIDER: GSE223743 | GEO | 2024/01/04

REPOSITORIES: GEO

Similar Datasets

| EGAS00001002697 | EGA
2019-05-06 | PXD013057 | Pride
2017-05-01 | GSE76877 | GEO
2022-12-01 | GSE135631 | GEO
2018-01-09 | PXD008270 | Pride
2017-01-01 | GSE76026 | GEO
2017-05-01 | E-GEOD-76877 | biostudies-arrayexpress
2017-01-01 | GSE73628 | GEO
2022-06-28 | PXD032201 | Pride
2020-12-09 | PXD019915 | JPOST Repository