Unknown

Dataset Information

0

Variance Reduction in Stochastic Gradient Langevin Dynamics.


ABSTRACT: Stochastic gradient-based Monte Carlo methods such as stochastic gradient Langevin dynamics are useful tools for posterior inference on large scale datasets in many machine learning applications. These methods scale to large datasets by using noisy gradients calculated using a mini-batch or subset of the dataset. However, the high variance inherent in these noisy gradients degrades performance and leads to slower mixing. In this paper, we present techniques for reducing variance in stochastic gradient Langevin dynamics, yielding novel stochastic Monte Carlo methods that improve performance by reducing the variance in the stochastic gradient. We show that our proposed method has better theoretical guarantees on convergence rate than stochastic Langevin dynamics. This is complemented by impressive empirical results obtained on a variety of real world datasets, and on four different machine learning tasks (regression, classification, independent component analysis and mixture modeling). These theoretical and empirical contributions combine to make a compelling case for using variance reduction in stochastic Monte Carlo methods.

SUBMITTER: Dubey A 

PROVIDER: S-EPMC5508544 | biostudies-literature | 2016 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Variance Reduction in Stochastic Gradient Langevin Dynamics.

Dubey Avinava A   Reddi Sashank J SJ   Póczos Barnabás B   Smola Alexander J AJ   Xing Eric P EP   Williamson Sinead A SA  

Advances in neural information processing systems 20161201


Stochastic gradient-based Monte Carlo methods such as stochastic gradient Langevin dynamics are useful tools for posterior inference on large scale datasets in many machine learning applications. These methods scale to large datasets by using noisy gradients calculated using a mini-batch or subset of the dataset. However, the high variance inherent in these noisy gradients degrades performance and leads to slower mixing. In this paper, we present techniques for reducing variance in stochastic gr  ...[more]

Similar Datasets

| S-EPMC8457681 | biostudies-literature
| S-EPMC6990154 | biostudies-literature
| S-EPMC7936325 | biostudies-literature
| S-EPMC8173568 | biostudies-literature
| S-EPMC2915568 | biostudies-literature
| S-EPMC8580443 | biostudies-literature
| S-EPMC2673189 | biostudies-literature
| S-EPMC4308582 | biostudies-literature
| S-EPMC2632580 | biostudies-literature
| S-EPMC2914651 | biostudies-literature