Genomics

Dataset Information

0

Multiscale representation of genomic signals


ABSTRACT: Genomic information is encoded on a wide range of distance scales, ranging from tens of base pairs to megabases. We developed a multiscale framework to analyze and visualize the information content of genomic signals. Different types of signals, such as GC content or DNA methylation, are characterized by distinct patterns of signal enrichment or depletion across scales spanning several orders of magnitude. These patterns are associated with a variety of genomic annotations, including genes, nuclear lamina associated domains, and repeat elements. By integrating the information across all scales, as compared to using any single scale, we demonstrate improved prediction of gene expression from Polymerase II ChIP-seq measurements and we observed that gene expression differences in colorectal cancer are not most strongly related to gene body methylation, but rather to methylation patterns that extend beyond the single-gene scale.

ORGANISM(S): Mus musculus

PROVIDER: GSE54414 | GEO | 2014/04/13

SECONDARY ACCESSION(S): PRJNA236459

REPOSITORIES: GEO

Similar Datasets

2014-04-13 | E-GEOD-54414 | biostudies-arrayexpress
| PRJNA236459 | ENA
2019-12-03 | GSE114840 | GEO
2024-04-24 | GSE265819 | GEO
2024-04-23 | GSE264334 | GEO
2024-04-23 | GSE264321 | GEO
2024-04-23 | GSE264393 | GEO
| PRJNA789579 | ENA
2007-12-17 | E-MAXD-23 | biostudies-arrayexpress
2019-12-01 | GSE128290 | GEO