Unknown

Dataset Information

0

DeFine: deep convolutional neural networks accurately quantify intensities of transcription factor-DNA binding and facilitate evaluation of functional non-coding variants.


ABSTRACT: The complex system of gene expression is regulated by the cell type-specific binding of transcription factors (TFs) to regulatory elements. Identifying variants that disrupt TF binding and lead to human diseases remains a great challenge. To address this, we implement sequence-based deep learning models that accurately predict the TF binding intensities to given DNA sequences. In addition to accurately classifying TF-DNA binding or unbinding, our models are capable of accurately predicting real-valued TF binding intensities by leveraging large-scale TF ChIP-seq data. The changes in the TF binding intensities between the altered sequence and the reference sequence reflect the degree of functional impact for the variant. This enables us to develop the tool DeFine (Deep learning based Functional impact of non-coding variants evaluator, http://define.cbi.pku.edu.cn) with improved performance for assessing the functional impact of non-coding variants including SNPs and indels. DeFine accurately identifies the causal functional non-coding variants from disease-associated variants in GWAS. DeFine is an effective and easy-to-use tool that facilities systematic prioritization of functional non-coding variants.

SUBMITTER: Wang M 

PROVIDER: S-EPMC6009584 | biostudies-literature | 2018 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

DeFine: deep convolutional neural networks accurately quantify intensities of transcription factor-DNA binding and facilitate evaluation of functional non-coding variants.

Wang Meng M   Tai Cheng C   E Weinan W   Wei Liping L  

Nucleic acids research 20180601 11


The complex system of gene expression is regulated by the cell type-specific binding of transcription factors (TFs) to regulatory elements. Identifying variants that disrupt TF binding and lead to human diseases remains a great challenge. To address this, we implement sequence-based deep learning models that accurately predict the TF binding intensities to given DNA sequences. In addition to accurately classifying TF-DNA binding or unbinding, our models are capable of accurately predicting real-  ...[more]

Similar Datasets

| S-EPMC6880932 | biostudies-literature
| S-EPMC7755594 | biostudies-literature
| S-EPMC6010233 | biostudies-other
| S-EPMC5773911 | biostudies-literature
| S-EPMC7649013 | biostudies-literature
| S-EPMC5552800 | biostudies-other
| S-EPMC6925141 | biostudies-literature
| S-EPMC5808454 | biostudies-literature
| S-EPMC5519034 | biostudies-literature
| S-EPMC7924482 | biostudies-literature