Unknown

Dataset Information

0

A sequence-based global map of regulatory activity for deciphering human genetics.


ABSTRACT: Epigenomic profiling has enabled large-scale identification of regulatory elements, yet we still lack a systematic mapping from any sequence or variant to regulatory activities. We address this challenge with Sei, a framework for integrating human genetics data with sequence information to discover the regulatory basis of traits and diseases. Sei learns a vocabulary of regulatory activities, called sequence classes, using a deep learning model that predicts 21,907 chromatin profiles across >1,300 cell lines and tissues. Sequence classes provide a global classification and quantification of sequence and variant effects based on diverse regulatory activities, such as cell type-specific enhancer functions. These predictions are supported by tissue-specific expression, expression quantitative trait loci and evolutionary constraint data. Furthermore, sequence classes enable characterization of the tissue-specific, regulatory architecture of complex traits and generate mechanistic hypotheses for individual regulatory pathogenic mutations. We provide Sei as a resource to elucidate the regulatory basis of human health and disease.

SUBMITTER: Chen KM 

PROVIDER: S-EPMC9279145 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC6344852 | biostudies-other
| S-EPMC10716911 | biostudies-literature
| S-EPMC2974261 | biostudies-literature
2021-06-01 | GSE145753 | GEO
| S-EPMC3629779 | biostudies-literature
| S-EPMC3218825 | biostudies-literature
2014-01-01 | E-GEOD-41222 | biostudies-arrayexpress
| S-EPMC7392335 | biostudies-literature
2014-01-01 | GSE41222 | GEO
| S-EPMC7490824 | biostudies-literature