Unknown

Dataset Information

0

ISOWN: accurate somatic mutation identification in the absence of normal tissue controls.


ABSTRACT: A key step in cancer genome analysis is the identification of somatic mutations in the tumor. This is typically done by comparing the genome of the tumor to the reference genome sequence derived from a normal tissue taken from the same donor. However, there are a variety of common scenarios in which matched normal tissue is not available for comparison.In this work, we describe an algorithm to distinguish somatic single nucleotide variants (SNVs) in next-generation sequencing data from germline polymorphisms in the absence of normal samples using a machine learning approach. Our algorithm was evaluated using a family of supervised learning classifications across six different cancer types and ~1600 samples, including cell lines, fresh frozen tissues, and formalin-fixed paraffin-embedded tissues; we tested our algorithm with both deep targeted and whole-exome sequencing data. Our algorithm correctly classified between 95 and 98% of somatic mutations with F1-measure ranges from 75.9 to 98.6% depending on the tumor type. We have released the algorithm as a software package called ISOWN (Identification of SOmatic mutations Without matching Normal tissues).In this work, we describe the development, implementation, and validation of ISOWN, an accurate algorithm for predicting somatic mutations in cancer tissues in the absence of matching normal tissues. ISOWN is available as Open Source under Apache License 2.0 from https://github.com/ikalatskaya/ISOWN .

SUBMITTER: Kalatskaya I 

PROVIDER: S-EPMC5490163 | biostudies-literature | 2017 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

ISOWN: accurate somatic mutation identification in the absence of normal tissue controls.

Kalatskaya Irina I   Trinh Quang M QM   Spears Melanie M   McPherson John D JD   Bartlett John M S JMS   Stein Lincoln L  

Genome medicine 20170629 1


<h4>Background</h4>A key step in cancer genome analysis is the identification of somatic mutations in the tumor. This is typically done by comparing the genome of the tumor to the reference genome sequence derived from a normal tissue taken from the same donor. However, there are a variety of common scenarios in which matched normal tissue is not available for comparison.<h4>Results</h4>In this work, we describe an algorithm to distinguish somatic single nucleotide variants (SNVs) in next-genera  ...[more]

Similar Datasets

| S-EPMC6399298 | biostudies-literature
| S-EPMC5584341 | biostudies-literature
| S-EPMC5988673 | biostudies-literature
| S-EPMC4856976 | biostudies-literature
| S-EPMC8419911 | biostudies-literature
| EGAS00001002857 | EGA
| EGAS00001002858 | EGA
| S-EPMC3259434 | biostudies-literature
| S-EPMC8492474 | biostudies-literature
| S-EPMC3907544 | biostudies-literature