Unknown

Dataset Information

0

GOTHiC, a probabilistic model to resolve complex biases and to identify real interactions in Hi-C data.


ABSTRACT: Hi-C is one of the main methods for investigating spatial co-localisation of DNA in the nucleus. However, the raw sequencing data obtained from Hi-C experiments suffer from large biases and spurious contacts, making it difficult to identify true interactions. Existing methods use complex models to account for biases and do not provide a significance threshold for detecting interactions. Here we introduce a simple binomial probabilistic model that resolves complex biases and distinguishes between true and false interactions. The model corrects biases of known and unknown origin and yields a p-value for each interaction, providing a reliable threshold based on significance. We demonstrate this experimentally by testing the method against a random ligation dataset. Our method outperforms previous methods and provides a statistical framework for further data analysis, such as comparisons of Hi-C interactions between different conditions. GOTHiC is available as a BioConductor package (http://www.bioconductor.org/packages/release/bioc/html/GOTHiC.html).

SUBMITTER: Mifsud B 

PROVIDER: S-EPMC5381888 | biostudies-literature | 2017

REPOSITORIES: biostudies-literature

altmetric image

Publications

GOTHiC, a probabilistic model to resolve complex biases and to identify real interactions in Hi-C data.

Mifsud Borbala B   Martincorena Inigo I   Darbo Elodie E   Sugar Robert R   Schoenfelder Stefan S   Fraser Peter P   Luscombe Nicholas M NM  

PloS one 20170405 4


Hi-C is one of the main methods for investigating spatial co-localisation of DNA in the nucleus. However, the raw sequencing data obtained from Hi-C experiments suffer from large biases and spurious contacts, making it difficult to identify true interactions. Existing methods use complex models to account for biases and do not provide a significance threshold for detecting interactions. Here we introduce a simple binomial probabilistic model that resolves complex biases and distinguishes between  ...[more]

Similar Datasets

| S-EPMC9272800 | biostudies-literature
| S-EPMC5740158 | biostudies-literature
| S-EPMC3509491 | biostudies-literature
| S-EPMC6391755 | biostudies-literature
| S-EPMC9262194 | biostudies-literature
| S-EPMC5442359 | biostudies-literature
| S-EPMC5860379 | biostudies-literature
| S-EPMC8574949 | biostudies-literature
| S-EPMC4853436 | biostudies-literature
| S-EPMC11367764 | biostudies-literature