Unknown

Dataset Information

0

A simple convolutional neural network for prediction of enhancer-promoter interactions with DNA sequence data.


ABSTRACT: MOTIVATION:Enhancer-promoter interactions (EPIs) in the genome play an important role in transcriptional regulation. EPIs can be useful in boosting statistical power and enhancing mechanistic interpretation for disease- or trait-associated genetic variants in genome-wide association studies. Instead of expensive and time-consuming biological experiments, computational prediction of EPIs with DNA sequence and other genomic data is a fast and viable alternative. In particular, deep learning and other machine learning methods have been demonstrated with promising performance. RESULTS:First, using a published human cell line dataset, we demonstrate that a simple convolutional neural network (CNN) performs as well as, if no better than, a more complicated and state-of-the-art architecture, a hybrid of a CNN and a recurrent neural network. More importantly, in spite of the well-known cell line-specific EPIs (and corresponding gene expression), in contrast to the standard practice of training and predicting for each cell line separately, we propose two transfer learning approaches to training a model using all cell lines to various extents, leading to substantially improved predictive performance. AVAILABILITY AND IMPLEMENTATION:Computer code is available at https://github.com/zzUMN/Combine-CNN-Enhancer-and-Promoters. SUPPLEMENTARY INFORMATION:Supplementary data are available at Bioinformatics online.

SUBMITTER: Zhuang Z 

PROVIDER: S-EPMC6735851 | biostudies-literature | 2019 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

A simple convolutional neural network for prediction of enhancer-promoter interactions with DNA sequence data.

Zhuang Zhong Z   Shen Xiaotong X   Pan Wei W  

Bioinformatics (Oxford, England) 20190901 17


<h4>Motivation</h4>Enhancer-promoter interactions (EPIs) in the genome play an important role in transcriptional regulation. EPIs can be useful in boosting statistical power and enhancing mechanistic interpretation for disease- or trait-associated genetic variants in genome-wide association studies. Instead of expensive and time-consuming biological experiments, computational prediction of EPIs with DNA sequence and other genomic data is a fast and viable alternative. In particular, deep learnin  ...[more]

Similar Datasets

| S-EPMC6929276 | biostudies-literature
| S-EPMC7648314 | biostudies-literature
| S-EPMC7016741 | biostudies-literature
| S-EPMC11323586 | biostudies-literature
| S-EPMC6961579 | biostudies-literature
| S-EPMC8501998 | biostudies-literature
| S-EPMC7013409 | biostudies-literature
| S-EPMC5954283 | biostudies-literature
| S-EPMC5870728 | biostudies-literature
| S-EPMC7054402 | biostudies-literature