Unknown

Dataset Information

0

Summarizing specific profiles in Illumina sequencing from whole-genome amplified DNA.


ABSTRACT: Advances in both high-throughput sequencing and whole-genome amplification (WGA) protocols have allowed genomes to be sequenced from femtograms of DNA, for example from individual cells or from precious clinical and archived samples. Using the highly curated Caenorhabditis elegans genome as a reference, we have sequenced and identified errors and biases associated with Illumina library construction, library insert size, different WGA methods and genome features such as GC bias and simple repeat content. Detailed analysis of the reads from amplified libraries revealed characteristics suggesting that majority of amplified fragment ends are identical but inverted versions of each other. Read coverage in amplified libraries is correlated with both tandem and inverted repeat content, while GC content only influences sequencing in long-insert libraries. Nevertheless, single nucleotide polymorphism (SNP) calls and assembly metrics from reads in amplified libraries show comparable results with unamplified libraries. To utilize the full potential of WGA to reveal the real biological interest, this article highlights the importance of recognizing additional sources of errors from amplified sequence reads and discusses the potential implications in downstream analyses.

SUBMITTER: Tsai IJ 

PROVIDER: S-EPMC4060946 | biostudies-literature | 2014 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Summarizing specific profiles in Illumina sequencing from whole-genome amplified DNA.

Tsai Isheng J IJ   Hunt Martin M   Holroyd Nancy N   Huckvale Thomas T   Berriman Matthew M   Kikuchi Taisei T  

DNA research : an international journal for rapid publication of reports on genes and genomes 20131218 3


Advances in both high-throughput sequencing and whole-genome amplification (WGA) protocols have allowed genomes to be sequenced from femtograms of DNA, for example from individual cells or from precious clinical and archived samples. Using the highly curated Caenorhabditis elegans genome as a reference, we have sequenced and identified errors and biases associated with Illumina library construction, library insert size, different WGA methods and genome features such as GC bias and simple repeat  ...[more]

Similar Datasets

| S-EPMC2956718 | biostudies-literature
| S-EPMC3534403 | biostudies-literature
| S-EPMC5838969 | biostudies-literature
| S-EPMC4835089 | biostudies-literature
| S-EPMC10132692 | biostudies-literature
| S-EPMC3473372 | biostudies-literature
| S-EPMC5905984 | biostudies-literature
| S-EPMC2797715 | biostudies-literature
| S-EPMC8002175 | biostudies-literature
| S-EPMC5971424 | biostudies-literature