Publication | Open Access
Cyclical Stochastic Gradient MCMC for Bayesian Deep Learning
51
Citations
34
References
2020
Year
Artificial IntelligenceConvolutional Neural NetworkEngineeringMachine LearningSequential LearningMultimodal LearningMarkov Chain Monte CarloBayesian Deep LearningRecurrent Neural NetworkBayesian InferenceData ScienceSparse Neural NetworkRobot LearningBayesian Hierarchical ModelingCyclical Sg-mcmcNeural Network WeightsComputer ScienceDeep LearningNew Modes
The posteriors over neural network weights are high dimensional and multimodal. Each mode typically characterizes a meaningfully different representation of the data. We develop Cyclical Stochastic Gradient MCMC (SG-MCMC) to automatically explore such distributions. In particular, we propose a cyclical stepsize schedule, where larger steps discover new modes, and smaller steps characterize each mode. We prove non-asymptotic convergence theory of our proposed algorithm. Moreover, we provide extensive experimental results, including ImageNet, to demonstrate the effectiveness of cyclical SG-MCMC in learning complex multimodal distributions, especially for fully Bayesian inference with modern deep neural networks.
| Year | Citations | |
|---|---|---|
Page 1
Page 1