Unknown

Dataset Information

0

S3norm: simultaneous normalization of sequencing depth and signal-to-noise ratio in epigenomic data.


ABSTRACT: Quantitative comparison of epigenomic data across multiple cell types or experimental conditions is a promising way to understand the biological functions of epigenetic modifications. However, differences in sequencing depth and signal-to-noise ratios in the data from different experiments can hinder our ability to identify real biological variation from raw epigenomic data. Proper normalization is required prior to data analysis to gain meaningful insights. Most existing methods for data normalization standardize signals by rescaling either background regions or peak regions, assuming that the same scale factor is applicable to both background and peak regions. While such methods adjust for differences in sequencing depths, they do not address differences in the signal-to-noise ratios across different experiments. We developed a new data normalization method, called S3norm, that normalizes the sequencing depths and signal-to-noise ratios across different data sets simultaneously by a monotonic nonlinear transformation. We show empirically that the epigenomic data normalized by our method, compared to existing methods, can better capture real biological variation, such as impact on gene expression regulation.

SUBMITTER: Xiang G 

PROVIDER: S-EPMC7192629 | biostudies-literature | 2020 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

S3norm: simultaneous normalization of sequencing depth and signal-to-noise ratio in epigenomic data.

Xiang Guanjue G   Keller Cheryl A CA   Giardine Belinda B   An Lin L   Li Qunhua Q   Zhang Yu Y   Hardison Ross C RC  

Nucleic acids research 20200501 8


Quantitative comparison of epigenomic data across multiple cell types or experimental conditions is a promising way to understand the biological functions of epigenetic modifications. However, differences in sequencing depth and signal-to-noise ratios in the data from different experiments can hinder our ability to identify real biological variation from raw epigenomic data. Proper normalization is required prior to data analysis to gain meaningful insights. Most existing methods for data normal  ...[more]

Similar Datasets

| S-EPMC3963315 | biostudies-literature
| S-EPMC9177987 | biostudies-literature
| S-EPMC6051708 | biostudies-literature
| S-EPMC10128961 | biostudies-literature
| S-EPMC2394959 | biostudies-literature
| S-EPMC3039683 | biostudies-literature
| S-EPMC10174484 | biostudies-literature
| S-EPMC5532082 | biostudies-literature
| S-EPMC1464087 | biostudies-literature
| S-EPMC6303234 | biostudies-literature