Unknown

Dataset Information

0

A Flexible Framework for Nonparametric Graphical Modeling that Accommodates Machine Learning.


ABSTRACT: Graphical modeling has been broadly useful for exploring the dependence structure among features in a dataset. However, the strength of graphical modeling hinges on our ability to encode and estimate conditional dependencies. In particular, commonly used measures such as partial correlation are only meaningful under strongly parametric (in this case, multivariate Gaussian) assumptions. These assumptions are unverifiable, and there is often little reason to believe they hold in practice. In this paper, we instead consider 3 nonparametric measures of conditional dependence. These measures are meaningful without structural assumptions on the multivariate distribution of the data. In addition, we show that for 2 of these measures there are simple, strong plug-in estimators that require only the estimation of a conditional mean. These plug-in estimators (1) are asymptotically linear and non-parametrically efficient, (2) allow incorporation of flexible machine learning techniques for conditional mean estimation, and (3) enable the construction of valid Wald-type confidence intervals. In addition, by leveraging the influence function of these estimators, one can obtain intervals with simultaneous coverage guarantees for all pairs of features.

SUBMITTER: Xiang Y 

PROVIDER: S-EPMC7787692 | biostudies-literature | 2020 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

A Flexible Framework for Nonparametric Graphical Modeling that Accommodates Machine Learning.

Xiang Yunhua Y   Simon Noah N  

Proceedings of machine learning research 20200701


Graphical modeling has been broadly useful for exploring the dependence structure among features in a dataset. However, the strength of graphical modeling hinges on our ability to encode and estimate conditional dependencies. In particular, commonly used measures such as partial correlation are only meaningful under strongly parametric (in this case, multivariate Gaussian) assumptions. These assumptions are unverifiable, and there is often little reason to believe they hold in practice. In this  ...[more]

Similar Datasets

| S-EPMC8017690 | biostudies-literature
| S-EPMC3158423 | biostudies-literature
| S-EPMC7946807 | biostudies-literature
| S-EPMC5545857 | biostudies-other
| S-EPMC8716047 | biostudies-literature
| S-EPMC7455482 | biostudies-literature
| S-EPMC6500604 | biostudies-other
| S-EPMC4552841 | biostudies-literature
| S-EPMC7937228 | biostudies-literature
2023-01-16 | GSE183256 | GEO