Unknown

Dataset Information

0

Robust in-silico identification of Cancer Cell Lines based on RNA and targeted DNA sequencing data.


ABSTRACT: Cancer cell lines (CCL) are an integral part of modern cancer research but are susceptible to misidentification. The increasing popularity of sequencing technologies motivates the in-silico identification of CCLs based on their mutational fingerprint, but care must be taken when identifying heterogeneous data. We recently developed the proof-of-concept Uniquorn 1 method which could reliably identify heterogeneous sequencing data from selected sequencing technologies. Here we present Uniquorn 2, a generic and robust in-silico identification method for CCLs with DNA/RNA-seq and panel-seq information. We benchmarked Uniquorn 2 by cross-identifying 1612?RNA and 3596 panel-sized NGS profiles derived from 1516 CCLs, five repositories, four technologies and three major cancer panel-designs. Our method achieves an accuracy of 96% for RNA-seq and 95% for mixed DNA-seq and RNA-seq identification. Even for a panel of only 94 cancer-related genes, accuracy remains at 82% but decreases when using smaller panels. Uniquorn 2 is freely available as R-Bioconductor-package 'Uniquorn'.

SUBMITTER: Otto R 

PROVIDER: S-EPMC6344579 | biostudies-literature | 2019 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Robust in-silico identification of Cancer Cell Lines based on RNA and targeted DNA sequencing data.

Otto Raik R   Rössler Jan-Niklas JN   Sers Christine C   Mamlouk Soulafa S   Leser Ulf U  

Scientific reports 20190123 1


Cancer cell lines (CCL) are an integral part of modern cancer research but are susceptible to misidentification. The increasing popularity of sequencing technologies motivates the in-silico identification of CCLs based on their mutational fingerprint, but care must be taken when identifying heterogeneous data. We recently developed the proof-of-concept Uniquorn 1 method which could reliably identify heterogeneous sequencing data from selected sequencing technologies. Here we present Uniquorn 2,  ...[more]

Similar Datasets

| S-EPMC5470969 | biostudies-literature
| S-EPMC7233376 | biostudies-literature
| S-EPMC1994535 | biostudies-other
| S-EPMC7772531 | biostudies-literature
| S-EPMC5356875 | biostudies-literature
| S-EPMC8369412 | biostudies-literature
| S-EPMC3389765 | biostudies-literature
| S-EPMC4227762 | biostudies-literature
| S-EPMC6016759 | biostudies-literature
| S-EPMC7279618 | biostudies-literature