In silico analysis identifies novel restriction enzyme combinations that expand reduced representation bisulfite sequencing CpG coverage.
Ontology highlight
ABSTRACT: Epigenetics is the study of gene expression changes that are not caused by changes in the deoxyribonucleic acid (DNA) sequence. DNA methylation is an epigenetic mark occurring in C-phosphate-G sites (CpGs) that leads to local or regional gene expression changes. Reduced-representation bisulfite sequencing (RRBS) is a technique that is used to ascertain the DNA methylation of millions of CpGs at single-nucleotide resolution. The genomic coverage of RRBS is given by the restriction enzyme combination used during the library preparation and the throughput capacity of the next-generation sequencer, which is used to read the generated libraries. The four-nucleotide cutters, MspI and Taq?I, are restriction enzymes commonly used in RRBS that, when combined, achieve ~12% genomic coverage. The increase in throughput of next-generation sequencers allows for novel combinations of restriction enzymes that provide higher CpG coverage.We performed a near-neighbor analysis of the four nucleotide sequences most frequently found within 50 nt of all genomic CpGs. This resulted in the identification of seven methylation-insensitive restriction enzymes (AluI, BfaI, HaeIII, HpyCH4V, MluCI, MseI, and MspI) that shared similar restriction conditions suitable for RRBS library preparation. We report that the use of two or three enzyme combinations increases the theoretical epigenome coverage to almost half of the human genome.We provide the enzyme combinations that are more likely to increase the CpG coverage in human, rat, and mouse genomes.
SUBMITTER: Martinez-Arguelles DB
PROVIDER: S-EPMC4141122 | biostudies-literature | 2014 Aug
REPOSITORIES: biostudies-literature
ACCESS DATA