Unknown

Dataset Information

0

Meffil: efficient normalization and analysis of very large DNA methylation datasets.


ABSTRACT:

Motivation

DNA methylation datasets are growing ever larger both in sample size and genome coverage. Novel computational solutions are required to efficiently handle these data.

Results

We have developed meffil, an R package designed for efficient quality control, normalization and epigenome-wide association studies of large samples of Illumina Methylation BeadChip microarrays. A complete re-implementation of functional normalization minimizes computational memory without increasing running time. Incorporating fixed and random effects within functional normalization, and automated estimation of functional normalization parameters reduces technical variation in DNA methylation levels, thus reducing false positive rates and improving power. Support for normalization of datasets distributed across physically different locations without needing to share biologically-based individual-level data means that meffil can be used to reduce heterogeneity in meta-analyses of epigenome-wide association studies.

Availability and implementation

https://github.com/perishky/meffil/.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Min JL 

PROVIDER: S-EPMC6247925 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC6419913 | biostudies-literature
| S-EPMC5870771 | biostudies-literature
| S-EPMC2557142 | biostudies-literature
| S-EPMC11003185 | biostudies-literature
| S-EPMC3106185 | biostudies-literature
| S-EPMC9410889 | biostudies-literature
| S-EPMC4138177 | biostudies-literature
| S-EPMC5343294 | biostudies-literature
| S-EPMC5394940 | biostudies-literature
| S-EPMC2865495 | biostudies-literature