Unknown

Dataset Information

0

Identification of somatic mutations in single cell DNA-seq using a spatial model of allelic imbalance.


ABSTRACT: Recent advances in single cell technology have enabled dissection of cellular heterogeneity in great detail. However, analysis of single cell DNA sequencing data remains challenging due to bias and artifacts that arise during DNA extraction and whole-genome amplification, including allelic imbalance and dropout. Here, we present a framework for statistical estimation of allele-specific amplification imbalance at any given position in single cell whole-genome sequencing data by utilizing the allele frequencies of heterozygous single nucleotide polymorphisms in the neighborhood. The resulting allelic imbalance profile is critical for determining whether the variant allele fraction of an observed mutation is consistent with the expected fraction for a true variant. This method, implemented in SCAN-SNV (Single Cell ANalysis of SNVs), substantially improves the identification of somatic variants in single cells. Our allele balance framework is broadly applicable to genotype analysis of any variant type in any data that might exhibit allelic imbalance.

SUBMITTER: Luquette LJ 

PROVIDER: S-EPMC6715686 | biostudies-literature | 2019 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Identification of somatic mutations in single cell DNA-seq using a spatial model of allelic imbalance.

Luquette Lovelace J LJ   Bohrson Craig L CL   Sherman Max A MA   Park Peter J PJ  

Nature communications 20190829 1


Recent advances in single cell technology have enabled dissection of cellular heterogeneity in great detail. However, analysis of single cell DNA sequencing data remains challenging due to bias and artifacts that arise during DNA extraction and whole-genome amplification, including allelic imbalance and dropout. Here, we present a framework for statistical estimation of allele-specific amplification imbalance at any given position in single cell whole-genome sequencing data by utilizing the alle  ...[more]

Similar Datasets

| S-EPMC5431982 | biostudies-literature
| S-EPMC4430307 | biostudies-literature
| S-EPMC11343128 | biostudies-literature
| S-EPMC4148184 | biostudies-literature
| S-EPMC5117254 | biostudies-literature
| S-EPMC4648491 | biostudies-literature
| S-EPMC6454553 | biostudies-literature
| S-EPMC4230747 | biostudies-literature
| S-EPMC8626927 | biostudies-literature
| S-EPMC5547479 | biostudies-other