Unknown

Dataset Information

0

Experiences with array-based sequence capture; toward clinical applications.


ABSTRACT: Although sequencing of a human genome gradually becomes an option, zooming in on the region of interest remains attractive and cost saving. We performed array-based sequence capture using 385K Roche NimbleGen, Inc. arrays to zoom in on the protein-coding and immediate intron-flanking sequences of 112 genes, potentially involved in mental retardation and congenital malformation. Captured material was sequenced using Illumina technology. A data analysis pipeline was built that detects sequence variants, positions them in relation to the gene, checks for presence in databases (eg, db single-nucleotide polymorphism (SNP)) and predicts the potential consequences at the level of RNA splicing and protein translation. In the samples analyzed, all known variants were reliably detected, including pathogenic variants from control cases and SNPs derived from array experiments. Although overall coverage varied considerably, it was reproducible per region and facilitated the detection of large deletions and duplications (copy number variations), including a partial deletion in the B3GALTL gene from a patient sample. For ultimate diagnostic application, overall results need to be improved. Future arrays should contain probes from both DNA strands, and to obtain a more even coverage, one could add fewer probes from densely and more probes from sparsely covered regions.

SUBMITTER: Almomani R 

PROVIDER: S-EPMC3039511 | biostudies-literature | 2011 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Experiences with array-based sequence capture; toward clinical applications.

Almomani Rowida R   van der Heijden Jaap J   Ariyurek Yavuz Y   Lai Yuching Y   Bakker Egbert E   van Galen Michiel M   Breuning Martijn H MH   den Dunnen Johan T JT  

European journal of human genetics : EJHG 20101124 1


Although sequencing of a human genome gradually becomes an option, zooming in on the region of interest remains attractive and cost saving. We performed array-based sequence capture using 385K Roche NimbleGen, Inc. arrays to zoom in on the protein-coding and immediate intron-flanking sequences of 112 genes, potentially involved in mental retardation and congenital malformation. Captured material was sequenced using Illumina technology. A data analysis pipeline was built that detects sequence var  ...[more]

Similar Datasets

| S-EPMC3140021 | biostudies-literature
| S-EPMC8397432 | biostudies-literature
| S-EPMC7598885 | biostudies-literature
| S-EPMC7952987 | biostudies-literature
| S-EPMC4275883 | biostudies-other
| S-EPMC8002224 | biostudies-literature
2020-10-23 | GSE159431 | GEO
| S-EPMC7276250 | biostudies-literature
2006-07-28 | GSE4775 | GEO
| S-EPMC4788223 | biostudies-literature