TY - JOUR
T1 - Posterior sampling strategies based on discretized stochastic differential equations for machine learning applications
AU - Heber, Frederik
AU - Trstanova, Zofia
AU - Leimkuhler, Benedict
PY - 2020/7/31
Y1 - 2020/7/31
N2 - With the advent of GPU-assisted hardware and maturing high-efficiency software platformssuch as TensorFlow and PyTorch, Bayesian posterior sampling for neural networks becomes plausible. In this article we discuss Bayesian parametrization in machine learning based on Markov Chain Monte Carlo methods, specifically discretized stochastic differential equations such as Langevin dynamics and extended system methods in which an ensemble of walkers is employed to enhance sampling. We provide a glimpse of the potential of the sampling-intensive approach by studying (and visualizing) the loss landscape of a neural network applied to the MNIST data set. Moreover, we investigate how the sampling efficiency itself can be significantly enhanced through an ensemble quasi-Newton preconditioning method. This article accompanies the release of a new TensorFlow software package, the ThermodynamicAnalytics ToolkIt, which is used in the computational experiment
AB - With the advent of GPU-assisted hardware and maturing high-efficiency software platformssuch as TensorFlow and PyTorch, Bayesian posterior sampling for neural networks becomes plausible. In this article we discuss Bayesian parametrization in machine learning based on Markov Chain Monte Carlo methods, specifically discretized stochastic differential equations such as Langevin dynamics and extended system methods in which an ensemble of walkers is employed to enhance sampling. We provide a glimpse of the potential of the sampling-intensive approach by studying (and visualizing) the loss landscape of a neural network applied to the MNIST data set. Moreover, we investigate how the sampling efficiency itself can be significantly enhanced through an ensemble quasi-Newton preconditioning method. This article accompanies the release of a new TensorFlow software package, the ThermodynamicAnalytics ToolkIt, which is used in the computational experiment
M3 - Article
SN - 1532-4435
VL - 21
SP - 1
EP - 33
JO - Journal of Machine Learning Research
JF - Journal of Machine Learning Research
ER -