Genomics

Dataset Information

0

RNA sequencing and Illumina 2.5M SNP array data collected from 675 commonly used human cancer cell lines.


ABSTRACT: Tumor-derived cell lines have served as vital models to advance our understanding of oncogene function and therapeutic response1. Although substantial effort has been directed to defining the genomic constitution of cancer cell line panels2–4, the transcriptome – which represents the active program of a cell – remains understudied. Here, we describe RNA sequencing and SNP array analysis of 675 commonly used human cancer cell lines. We explore numerous transcriptome features including coding and non-coding gene expression, transcribed mutations, gene fusion and expression of non-human sequences. Aside from many known aberrations we find new surprising characteristics, including more than 2200 unique fusion gene pairs representing a vast, testable repertoire of oncogenic fusions, many of which have analogs found in primary human tumors. We show that a combination of multiple genome and transcriptome features in a novel pathway-based approach enhances prediction of response to various targeted therapeutics. Our results provide valuable new insights into these critical pre-clinical models and provide added context for interpreting the numerous studies that employ these widely used cell lines.

OTHER RELATED OMICS DATASETS IN: E-MTAB-2706

PROVIDER: EGAS00001000610 | EGA |

REPOSITORIES: EGA

altmetric image

Publications


Tumor-derived cell lines have served as vital models to advance our understanding of oncogene function and therapeutic responses. Although substantial effort has been made to define the genomic constitution of cancer cell line panels, the transcriptome remains understudied. Here we describe RNA sequencing and single-nucleotide polymorphism (SNP) array analysis of 675 human cancer cell lines. We report comprehensive analyses of transcriptome features including gene expression, mutations, gene fus  ...[more]

Similar Datasets

2014-11-25 | E-MTAB-2706 | biostudies-arrayexpress
2014-09-18 | E-GEOD-50941 | biostudies-arrayexpress
2018-10-07 | MSV000083006 | MassIVE
| phs000602 | dbGaP
2023-02-16 | PXD035100 | Pride
2024-06-18 | PXD052856 | Pride
| PRJNA330685 | ENA
2015-04-18 | E-GEOD-55727 | biostudies-arrayexpress
2024-12-03 | E-MTAB-14655 | biostudies-arrayexpress
2021-12-05 | ST002010 | MetabolomicsWorkbench