Unknown,Transcriptomics,Genomics,Proteomics

Dataset Information

0

NimbleGen 42M data for the HuRef individual


ABSTRACT: The ideal genome sequence for medical interpretation is complete and diploid, capturing the full spectrum of genetic variation. Toward this end, there has been progress in discovery of single nucleotide polymorphism (SNP) and small (<10bp) insertion/deletions (indels), but annotation of larger structural variation (SV) including copy number variation (CNV) has been less comprehensive, even with available diploid sequence assemblies. We applied a multi-step sequence and microarray-based analysis to identify numerous previously unknown SVs within the first genome sequence reported from an individual. The HuRef genomic DNA (from lymphoblastoid cell line) was co-hybridized with the female sample NA15510 (from lymphoblastoid cell line) from the Polymorphism Discovery Resources. The NimbleGen platform consists of 20 NimbleGen HD2 chips, each containing 2.1M probes, and each chip is further subdivided into 3 equal-sized subarrays containing about 726K probes. The probes target the NCBI Build 36 genome, with the exception that no homology filter was applied, allowing coverage of segmentally duplicated regions.

ORGANISM(S): Homo sapiens

SUBMITTER: Andy Pang 

PROVIDER: E-GEOD-20289 | biostudies-arrayexpress |

REPOSITORIES: biostudies-arrayexpress

altmetric image

Publications


<h4>Background</h4>Several genomes have now been sequenced, with millions of genetic variants annotated. While significant progress has been made in mapping single nucleotide polymorphisms (SNPs) and small (<10 bp) insertion/deletions (indels), the annotation of larger structural variants has been less comprehensive. It is still unclear to what extent a typical genome differs from the reference assembly, and the analysis of the genomes sequenced to date have shown varying results for copy number  ...[more]

Similar Datasets

2013-02-12 | E-GEOD-20275 | biostudies-arrayexpress
2013-02-12 | E-GEOD-20288 | biostudies-arrayexpress
2013-02-12 | E-GEOD-20287 | biostudies-arrayexpress
2013-02-12 | E-GEOD-20284 | biostudies-arrayexpress
2013-02-12 | GSE20289 | GEO
2013-02-12 | GSE20284 | GEO
2013-02-12 | GSE20275 | GEO
2013-02-12 | GSE20287 | GEO
2013-02-12 | GSE20288 | GEO
| PRJNA183336 | ENA