Abstract / Description of output
We provide a nonasymptotic analysis of the convergence of the stochastic gradient Hamiltonian Monte Carlo (SGHMC) to a target measure in Wasserstein-2 distance without assuming log-concavity. By making the dimension dependence explicit, we provide a uniform convergence rate of order $\mathcal{O}(\eta^{1/4} )$, where $\eta$ is the step-size. Our results shed light onto the performance of the SGHMC methods compared to their overdamped counterparts, e.g., stochastic gradient Langevin dynamics (SGLD). Furthermore, our results also imply that the SGHMC, when viewed as a nonconvex optimizer, converges to a global minimum with the best known rates.
Original language | English |
---|---|
Pages (from-to) | 1-34 |
Journal | Journal of Machine Learning Research |
Volume | 25 |
Issue number | 113 |
Publication status | Published - 31 Jan 2024 |
Keywords / Materials (for Non-textual outputs)
- math.OC
- stat.CO
- stat.ML