Unknown

Dataset Information

0

Accurate prediction of boundaries of high resolution topologically associated domains (TADs) in fruit flies using deep learning.


ABSTRACT: Genomes are organized into self-interacting chromatin regions called topologically associated domains (TADs). A significant number of TAD boundaries are shared across multiple cell types and conserved across species. Disruption of TAD boundaries may affect the expression of nearby genes and could lead to several diseases. Even though detection of TAD boundaries is important and useful, there are experimental challenges in obtaining high resolution TAD locations. Here, we present computational prediction of TAD boundaries from high resolution Hi-C data in fruit flies. By extensive exploration and testing of several deep learning model architectures with hyperparameter optimization, we show that a unique deep learning model consisting of three convolution layers followed by a long short-term-memory layer achieves an accuracy of 96%. This outperforms feature-based models' accuracy of 91% and an existing method's accuracy of 73-78% based on motif TRAP scores. Our method also detects previously reported motifs such as Beaf-32 that are enriched in TAD boundaries in fruit flies and also several unreported motifs.

SUBMITTER: Henderson J 

PROVIDER: S-EPMC6648328 | biostudies-literature | 2019 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Accurate prediction of boundaries of high resolution topologically associated domains (TADs) in fruit flies using deep learning.

Henderson John J   Ly Vi V   Olichwier Shawn S   Chainani Pranik P   Liu Yu Y   Soibam Benjamin B  

Nucleic acids research 20190701 13


Genomes are organized into self-interacting chromatin regions called topologically associated domains (TADs). A significant number of TAD boundaries are shared across multiple cell types and conserved across species. Disruption of TAD boundaries may affect the expression of nearby genes and could lead to several diseases. Even though detection of TAD boundaries is important and useful, there are experimental challenges in obtaining high resolution TAD locations. Here, we present computational pr  ...[more]

Similar Datasets

| S-EPMC5768762 | biostudies-literature
2022-09-16 | GSE188940 | GEO
| S-EPMC7160146 | biostudies-literature
2017-06-26 | GSE97965 | GEO
| S-EPMC5870651 | biostudies-literature
| PRJNA398162 | ENA
| S-EPMC6093941 | biostudies-literature
| S-EPMC7422557 | biostudies-literature
| S-EPMC6918570 | biostudies-literature
| S-EPMC3045854 | biostudies-literature