Genomics

Dataset Information

4

Complete Genomics paired end sequencing; Ovarian cancer


ABSTRACT: We sequenced the genomes from a monozygotic twin discordant for schizophrenia and a tumor-normal pair of an ovarian cancer patient. Using whole-genome twin data to discriminate between correctly identified single nucleotide variants (SNVs) and errors a strategy for the accurate detection of SNVs was developed. By applying stringent sequencing quality measures, excluding error-prone regions and selecting SNVs identified by different mapping and variation calling algorithms, error rates were ~37-fold reduced. This enabled us to identify the first discordant SNVs in monozygotic twins using whole-genome sequencing. In addition, by showing that novel SNVs are highly enriched in errors, accurate estimates of the number of novel and rare SNVs occurring in unrelated Caucasian individuals were obtained. Finally, somatic mutations in coding and regulatory sequences were reliably identified in the highly rearranged ovarian tumor. Overall, our data demonstrate that strategies to reduce error rates in whole-genomes are required for disease gene discovery.

PROVIDER: EGAS00001000158 | EGA |

REPOSITORIES: EGA

altmetric image

Publications


DNA replication errors that persist as mismatch mutations make up the molecular fingerprint of mismatch repair (MMR)-deficient tumors and convey them with resistance to standard therapy. Using whole-genome and whole-exome sequencing, we here confirm an MMR-deficient mutation signature that is distinct from other tumor genomes, but surprisingly similar to germ-line DNA, indicating that a substantial fraction of human genetic variation arises through mutations escaping MMR. Moreover, we identify a  ...[more]

Similar Datasets

| EGAS00001000152 | EGA
2014-01-25 | E-GEOD-54370 | biostudies-arrayexpress
2012-12-10 | GSE38291 | GEO
2012-10-20 | E-GEOD-33478 | biostudies-arrayexpress
2010-12-31 | GSE16461 | GEO
2014-01-25 | GSE54370 | GEO
2011-09-26 | GSE31439 | GEO
2018-08-02 | GSE100488 | GEO
2008-03-30 | E-GEOD-7624 | biostudies-arrayexpress
2007-03-31 | GSE7036 | GEO