Unknown

Dataset Information

0

Particle swarm optimization with reinforcement learning for the prediction of CpG islands in the human genome.


ABSTRACT:

Background

Regions with abundant GC nucleotides, a high CpG number, and a length greater than 200 bp in a genome are often referred to as CpG islands. These islands are usually located in the 5' end of genes. Recently, several algorithms for the prediction of CpG islands have been proposed.

Methodology/principal findings

We propose here a new method called CPSORL to predict CpG islands, which consists of a complement particle swarm optimization algorithm combined with reinforcement learning to predict CpG islands more reliably. Several CpG island prediction tools equipped with the sliding window technique have been developed previously. However, the quality of the results seems to rely too much on the choices that are made for the window sizes, and thus these methods leave room for improvement.

Conclusions/significance

Experimental results indicate that CPSORL provides results of a higher sensitivity and a higher correlation coefficient in all selected experimental contigs than the other methods it was compared to (CpGIS, CpGcluster, CpGProd and CpGPlot). A higher number of CpG islands were identified in chromosomes 21 and 22 of the human genome than with the other methods from the literature. CPSORL also achieved the highest coverage rate (3.4%). CPSORL is an application for identifying promoter and TSS regions associated with CpG islands in entire human genomic. When compared to CpGcluster, the islands predicted by CPSORL covered a larger region in the TSS (12.2%) and promoter (26.1%) region. If Alu sequences are considered, the islands predicted by CPSORL (Alu) covered a larger TSS (40.5%) and promoter (67.8%) region than CpGIS. Furthermore, CPSORL was used to verify that the average methylation density was 5.33% for CpG islands in the entire human genome.

SUBMITTER: Chuang LY 

PROVIDER: S-EPMC3125183 | biostudies-literature | 2011

REPOSITORIES: biostudies-literature

altmetric image

Publications

Particle swarm optimization with reinforcement learning for the prediction of CpG islands in the human genome.

Chuang Li-Yeh LY   Huang Hsiu-Chen HC   Lin Ming-Cheng MC   Yang Cheng-Hong CH  

PloS one 20110628 6


<h4>Background</h4>Regions with abundant GC nucleotides, a high CpG number, and a length greater than 200 bp in a genome are often referred to as CpG islands. These islands are usually located in the 5' end of genes. Recently, several algorithms for the prediction of CpG islands have been proposed.<h4>Methodology/principal findings</h4>We propose here a new method called CPSORL to predict CpG islands, which consists of a complement particle swarm optimization algorithm combined with reinforcemen  ...[more]

Similar Datasets

| S-EPMC4849747 | biostudies-literature
| S-EPMC2896535 | biostudies-literature
| S-EPMC4433345 | biostudies-literature
| S-EPMC3919054 | biostudies-other
| S-EPMC4365407 | biostudies-other
| S-EPMC4509494 | biostudies-literature
| S-EPMC5439980 | biostudies-literature
| S-EPMC5716574 | biostudies-literature