Unknown

Dataset Information

0

Discrepancies in cancer genomic sequencing highlight opportunities for driver mutation discovery.


ABSTRACT: Cancer genome sequencing is being used at an increasing rate to identify actionable driver mutations that can inform therapeutic intervention strategies. A comparison of two of the most prominent cancer genome sequencing databases from different institutes (Cancer Cell Line Encyclopedia and Catalogue of Somatic Mutations in Cancer) revealed marked discrepancies in the detection of missense mutations in identical cell lines (57.38% conformity). The main reason for this discrepancy is inadequate sequencing of GC-rich areas of the exome. We have therefore mapped over 400 regions of consistent inadequate sequencing (cold-spots) in known cancer-causing genes and kinases, in 368 of which neither institute finds mutations. We demonstrate, using a newly identified PAK4 mutation as proof of principle, that specific targeting and sequencing of these GC-rich cold-spot regions can lead to the identification of novel driver mutations in known tumor suppressors and oncogenes. We highlight that cross-referencing between genomic databases is required to comprehensively assess genomic alterations in commonly used cell lines and that there are still significant opportunities to identify novel drivers of tumorigenesis in poorly sequenced areas of the exome. Finally, we assess other reasons for the observed discrepancy, such as variations in dbSNP filtering and the acquisition/loss of mutations, to give explanations as to why there is a discrepancy in pharmacogenomic studies, given recent concerns with poor reproducibility of data.

SUBMITTER: Hudson AM 

PROVIDER: S-EPMC4247168 | biostudies-literature | 2014 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Discrepancies in cancer genomic sequencing highlight opportunities for driver mutation discovery.

Hudson Andrew M AM   Yates Tim T   Li Yaoyong Y   Trotter Eleanor W EW   Fawdar Shameem S   Chapman Phil P   Lorigan Paul P   Biankin Andrew A   Miller Crispin J CJ   Brognard John J  

Cancer research 20140925 22


Cancer genome sequencing is being used at an increasing rate to identify actionable driver mutations that can inform therapeutic intervention strategies. A comparison of two of the most prominent cancer genome sequencing databases from different institutes (Cancer Cell Line Encyclopedia and Catalogue of Somatic Mutations in Cancer) revealed marked discrepancies in the detection of missense mutations in identical cell lines (57.38% conformity). The main reason for this discrepancy is inadequate s  ...[more]

Similar Datasets

| S-EPMC4442685 | biostudies-literature
| PRJNA326795 | ENA
| S-EPMC1871563 | biostudies-literature
| S-EPMC4860638 | biostudies-literature
| S-EPMC5503303 | biostudies-literature
| S-EPMC3018819 | biostudies-other
| S-EPMC4133618 | biostudies-literature
| S-EPMC8039954 | biostudies-literature
| S-EPMC3483554 | biostudies-literature
| S-EPMC5449640 | biostudies-literature