Unknown

Dataset Information

0

EagleC: A deep-learning framework for detecting a full range of structural variations from bulk and single-cell contact maps.


ABSTRACT: The Hi-C technique has been shown to be a promising method to detect structural variations (SVs) in human genomes. However, algorithms that can use Hi-C data for a full-range SV detection have been severely lacking. Current methods can only identify interchromosomal translocations and long-range intrachromosomal SVs (>1 Mb) at less-than-optimal resolution. Therefore, we develop EagleC, a framework that combines deep-learning and ensemble-learning strategies to predict a full range of SVs at high resolution. We show that EagleC can uniquely capture a set of fusion genes that are missed by whole-genome sequencing or nanopore. Furthermore, EagleC also effectively captures SVs in other chromatin interaction platforms, such as HiChIP, Chromatin interaction analysis with paired-end tag sequencing (ChIA-PET), and capture Hi-C. We apply EagleC in more than 100 cancer cell lines and primary tumors and identify a valuable set of high-quality SVs. Last, we demonstrate that EagleC can be applied to single-cell Hi-C and used to study the SV heterogeneity in primary tumors.

SUBMITTER: Wang X 

PROVIDER: S-EPMC9200291 | biostudies-literature | 2022 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

EagleC: A deep-learning framework for detecting a full range of structural variations from bulk and single-cell contact maps.

Wang Xiaotao X   Luan Yu Y   Yue Feng F  

Science advances 20220615 24


The Hi-C technique has been shown to be a promising method to detect structural variations (SVs) in human genomes. However, algorithms that can use Hi-C data for a full-range SV detection have been severely lacking. Current methods can only identify interchromosomal translocations and long-range intrachromosomal SVs (>1 Mb) at less-than-optimal resolution. Therefore, we develop EagleC, a framework that combines deep-learning and ensemble-learning strategies to predict a full range of SVs at high  ...[more]

Similar Datasets

| S-EPMC6818797 | biostudies-literature
| S-EPMC8684312 | biostudies-literature
| S-EPMC3553935 | biostudies-literature
| S-EPMC7347923 | biostudies-literature
| S-EPMC8382278 | biostudies-literature
| S-EPMC7320627 | biostudies-literature
| S-EPMC5297918 | biostudies-literature
| S-EPMC11439270 | biostudies-literature
| S-EPMC11633756 | biostudies-literature
| S-EPMC8574626 | biostudies-literature