Weak approximation of transformed stochastic gradient MCMC

Yokoi Soma; Otsuka Takuma; Sato Issei

首页> 外文期刊>Machine Learning >Weak approximation of transformed stochastic gradient MCMC

【24h】

Weak approximation of transformed stochastic gradient MCMC

机译：变换随机梯度MCMC的弱近似

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Stochastic gradient Langevin dynamics (SGLD) is a computationally efficient sampler for Bayesian posterior inference given a large scale dataset and a complex model. Although SGLD is designed for unbounded random variables, practical models often incorporate variables within a bounded domain, such as non-negative or a finite interval. The use of variable transformation is a typical way to handle such a bounded variable. This paper reveals that several mapping approaches commonly used in the literature produce erroneous samples from theoretical and empirical perspectives. We show that the change of random variable in discretization using an invertible Lipschitz mapping function overcomes the pitfall as well as attains the weak convergence, while the other methods are numerically unstable or cannot be justified theoretically. Experiments demonstrate its efficacy for widely-used models with bounded latent variables, including Bayesian non-negative matrix factorization and binary neural networks.

机译：随机梯度Langevin Dynamics（SGLD）是给定大型数据集和复杂模型的贝叶斯后部推理的计算上有效的采样器。尽管SGLD被设计用于无限的随机变量，但实际模型通常在有界域内的变量融入，例如非负或有限间隔。使用变换是处理这种有界变量的典型方法。本文揭示了文献中常用的几种映射方法从理论和实证角度产生错误的样本。我们表明，使用可逆的Lipschitz映射函数的离散化随机变量的变化克服了缺陷以及达到弱收敛，而其他方法在实际上是不稳定的，或者理论上不能合理。实验证明了具有界限潜变量的广泛使用模型的功效，包括贝叶斯非负矩阵分解和二元神经网络。

著录项

来源
《Machine Learning》 |2020年第10期|1903-1923|共21页
作者
Yokoi Soma; Otsuka Takuma; Sato Issei;
展开▼
作者单位

Univ Tokyo Grad Sch Frontier Sci Dept Complex Sci & Engn 5-1-5 Kashiwanoha Kashiwa Chiba 2778561 Japan|RIKEN Chuo Ku 1-4-1 Nihonbashi Tokyo 1030027 Japan;

NTT Corp NTT Commun Sci Labs 2-4 Hikaridai Seika Cho Kyoto 6190237 Japan;

RIKEN Chuo Ku 1-4-1 Nihonbashi Tokyo 1030027 Japan|Univ Tokyo Grad Sch Informat Sci & Technol Dept Comp Sci Bunkyo Ku 7-3-1 Hongo Tokyo 1130033 Japan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Stochastic gradient MCMC; Transform; Convergence analysis; Ito process;

机译：随机梯度MCMC;变换;收敛分析;ITO过程;

相似文献

外文文献
中文文献
专利

1. WEAK CONVERGENCE RATES OF POPULATION VERSUS SINGLE-CHAIN STOCHASTIC APPROXIMATION MCMC ALGORITHMS [J] . Song Qifan, Wu Mingqi, Liang Faming Advances in applied probability . 2014,第4期

机译：人口与单链随机逼近MCMC算法的弱收敛速度
2. CPSG-MCMC: Clustering-Based Preprocessing method for Stochastic Gradient MCMC [J] . Tianfan Fu, Zhihua Zhang JMLR: Workshop and Conference Proceedings . 2017,第2009期

机译：CPSG-MCMC：随机梯度MCMC的基于聚类的预处理方法
3. UNIFORM-IN-TIME WEAK ERROR ANALYSIS FOR STOCHASTIC GRADIENT DESCENT ALGORITHMS VIA DIFFUSION APPROXIMATION [J] . Feng Yuanyuan, Gao Tingran, Li Lei, Communications in mathematical sciences . 2020,第1期

机译：通过扩散近似随机梯度下降算法的均匀时间弱误差分析
4. Stochastic Gradient MCMC with Stale Gradients [C] . Changyou Chen, Nan Ding, Chunyuan Li, Annual conference on Neural Information Processing Systems . 2016

机译：具有陈旧渐变的随机渐变MCMC
5. Scalable Molecular Design Using Reversible Jump MCMC and Stochastic Approximation [D] . Grinaway, Patrick B. 2019

机译：使用可逆跳跃MCMC和随机逼近的可扩展分子设计
6. Approximation of weak solution for the problem of a pH-gradient creation in isoelectrofocusing [O] . L. V. Sakharova, E. V. Shiryaeva, M. Yu. Zhukov -1

机译：等电聚焦中pH梯度产生问题的弱溶液的逼近
7. Weak Convergence Rates of Population versus Single-Chain Stochastic Approximation MCMC Algorithms [O] . Qifan Song, Mingqi Wu, Faming Liang 2013

机译：人口弱收敛率与单链随机逼近mCmC算法
8. I,II Convergence and Rate of Convergence Theorems for Constrained and Unconstrained Stochastic Approximation,via Weak Convergence Methods. III Numerical Studies for Constrained Stochastic Approximation Problems, [R] . kushner,harold j. lakshmivarahan, s. 1977

机译：I，II收敛性和受约束和无约束随机逼近的收敛速度定理，通过弱收敛方法。 III约束随机逼近问题的数值研究，

Weak approximation of transformed stochastic gradient MCMC

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅