Unknown

Dataset Information

0

Local epigenomic state cannot discriminate interacting and non-interacting enhancer-promoter pairs with high accuracy.


ABSTRACT: We report an experimental design issue in recent machine learning formulations of the enhancer-promoter interaction problem arising from the fact that many enhancer-promoter pairs share features. Cross-fold validation schemes which do not correctly separate these feature sharing enhancer-promoter pairs into one test set report high accuracy, which is actually arising from high training set accuracy and a failure to properly evaluate generalization performance. Cross-fold validation schemes which properly segregate pairs with shared features show markedly reduced ability to predict enhancer-promoter interactions from epigenomic state. Parameter scans with multiple models indicate that local epigenomic features of individual pairs of enhancers and promoters cannot distinguish those pairs that interact from those which do with high accuracy, suggesting that additional information is required to predict enhancer-promoter interactions.

SUBMITTER: Xi W 

PROVIDER: S-EPMC6298642 | biostudies-literature | 2018 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Local epigenomic state cannot discriminate interacting and non-interacting enhancer-promoter pairs with high accuracy.

Xi Wang W   Beer Michael A MA  

PLoS computational biology 20181218 12


We report an experimental design issue in recent machine learning formulations of the enhancer-promoter interaction problem arising from the fact that many enhancer-promoter pairs share features. Cross-fold validation schemes which do not correctly separate these feature sharing enhancer-promoter pairs into one test set report high accuracy, which is actually arising from high training set accuracy and a failure to properly evaluate generalization performance. Cross-fold validation schemes which  ...[more]

Similar Datasets

| S-EPMC7016741 | biostudies-literature
| S-EPMC5041931 | biostudies-literature
| S-EPMC4393516 | biostudies-literature
| S-EPMC7841892 | biostudies-literature
| S-EPMC6155075 | biostudies-literature
| S-EPMC6838673 | biostudies-literature
| S-EPMC3711120 | biostudies-literature
| S-EPMC7552563 | biostudies-literature
| S-EPMC5052795 | biostudies-literature
2016-03-28 | E-GEOD-46595 | biostudies-arrayexpress