Unknown

Dataset Information

0

Computational prediction of methylation status in human genomic sequences.


ABSTRACT: Epigenetic effects in mammals depend largely on heritable genomic methylation patterns. We describe a computational pattern recognition method that is used to predict the methylation landscape of human brain DNA. This method can be applied both to CpG islands and to non-CpG island regions. It computes the methylation propensity for an 800-bp region centered on a CpG dinucleotide based on specific sequence features within the region. We tested several classifiers for classification performance, including K means clustering, linear discriminant analysis, logistic regression, and support vector machine. The best performing classifier used the support vector machine approach. Our program (called hdfinder) presently has a prediction accuracy of 86%, as validated with CpG regions for which methylation status has been experimentally determined. Using hdfinder, we have depicted the entire genomic methylation patterns for all 22 human autosomes.

SUBMITTER: Das R 

PROVIDER: S-EPMC1502297 | biostudies-literature | 2006 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Computational prediction of methylation status in human genomic sequences.

Das Rajdeep R   Dimitrova Nevenka N   Xuan Zhenyu Z   Rollins Robert A RA   Haghighi Fatemah F   Edwards John R JR   Ju Jingyue J   Bestor Timothy H TH   Zhang Michael Q MQ  

Proceedings of the National Academy of Sciences of the United States of America 20060703 28


Epigenetic effects in mammals depend largely on heritable genomic methylation patterns. We describe a computational pattern recognition method that is used to predict the methylation landscape of human brain DNA. This method can be applied both to CpG islands and to non-CpG island regions. It computes the methylation propensity for an 800-bp region centered on a CpG dinucleotide based on specific sequence features within the region. We tested several classifiers for classification performance, i  ...[more]

Similar Datasets

| S-EPMC4352876 | biostudies-literature
| S-EPMC7490824 | biostudies-literature
| S-EPMC2375192 | biostudies-other
| S-EPMC3563418 | biostudies-literature
| S-EPMC1177389 | biostudies-other
| S-EPMC4712260 | biostudies-literature
| S-EPMC86212 | biostudies-literature
| S-EPMC8804317 | biostudies-literature
| S-EPMC6819637 | biostudies-literature
| S-EPMC3637798 | biostudies-literature