Genomics

Dataset Information

4

Full genome sequencing of a monozygotic twin discordant for schizophrenia


ABSTRACT: We sequenced the genomes from a monozygotic twin discordant for schizophrenia and a tumor-normal pair of an ovarian cancer patient. Using whole-genome twin data to discriminate between correctly identified single nucleotide variants (SNVs) and errors a strategy for the accurate detection of SNVs was developed. By applying stringent sequencing quality measures, excluding error-prone regions and selecting SNVs identified by different mapping and variation calling algorithms, error rates were ~37-fold reduced. This enabled us to identify the first discordant SNVs in monozygotic twins using whole-genome sequencing. In addition, by showing that novel SNVs are highly enriched in errors, accurate estimates of the number of novel and rare SNVs occurring in unrelated Caucasian individuals were obtained. Finally, somatic mutations in coding and regulatory sequences were reliably identified in the highly rearranged ovarian tumor. Overall, our data demonstrate that strategies to reduce error rates in whole-genomes are required for disease gene discovery.

PROVIDER: EGAS00001000152 | EGA |

REPOSITORIES: EGA

Similar Datasets

| EGAS00001000158 | EGA
2014-01-25 | E-GEOD-54370 | biostudies-arrayexpress
2012-12-10 | GSE38291 | GEO
2012-10-20 | E-GEOD-33478 | biostudies-arrayexpress
2010-12-31 | GSE16461 | GEO
2014-01-25 | GSE54370 | GEO
2011-09-26 | GSE31439 | GEO
2018-08-02 | GSE100488 | GEO
2008-03-30 | E-GEOD-7624 | biostudies-arrayexpress
2007-03-31 | GSE7036 | GEO