CPSG-MCMC: Clustering-Based Preprocessing method for Stochastic Gradient MCMC

Tianfan Fu; Zhihua Zhang

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >CPSG-MCMC: Clustering-Based Preprocessing method for Stochastic Gradient MCMC

【24h】

CPSG-MCMC: Clustering-Based Preprocessing method for Stochastic Gradient MCMC

机译：CPSG-MCMC：随机梯度MCMC的基于聚类的预处理方法

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In recent years, stochastic gradient Markov Chain Monte Carlo (SG-MCMC) methods have been raised to process large-scale dataset by iterative learning from small minibatches. However, the high variance caused by naive subsampling usually slows down the convergence to the desired posterior distribution. In this paper, we propose an effective subsampling strategy to reduce the variance based on a failed attempt to do importance sampling. In particular, before sampling, we partition the dataset with k-means clustering algorithm in a preprocessing step and use the fixed clustering throughout the entire MCMC simulation. Then during simulation, we approximate the gradient of log-posterior via summing the estimated gradient of each cluster. The resulting procedure is surprisingly simple without enhancing the complexity of the original algorithm during sampling procedure. We apply our Clustering-based Preprocessing strategy on stochastic gradient Langevin dynamics, stochastic gradient Hamilton Monte Carlo and stochastic gradient Riemann Langevin dynamics. Empirically, we provide thorough numerical results to back up the effectiveness and efficiency of our approach.

机译：近年来，已经提出了从小批处理中迭代学习的随机梯度马尔可夫链蒙特卡洛（SG-MCMC）方法来处理大规模数据集。但是，由朴素的子采样引起的高方差通常会降低收敛到所需后验分布的速度。在本文中，我们提出了一种有效的子采样策略，以减少基于重要性采样失败的方差。特别是，在采样之前，我们在预处理步骤中使用k-均值聚类算法对数据集进行分区，并在整个MCMC仿真中使用固定聚类。然后在仿真过程中，我们通过对每个聚类的估计梯度求和来近似对数后梯度。所得过程非常简单，却没有增加采样过程中原始算法的复杂性。我们将基于聚类的预处理策略应用于随机梯度Langevin动力学，随机梯度Hamilton Monte Carlo和随机梯度Riemann Langevin动力学。从经验上讲，我们提供了详尽的数值结果来支持我们方法的有效性和效率。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2017年第2009期|共10页
作者
Tianfan Fu; Zhihua Zhang;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Stochastic Gradient MCMC Methods for Hidden Markov Models [J] . Yi-An Ma, Nicholas J. Foti, Emily B. Fox JMLR: Workshop and Conference Proceedings . 2017,第2007期

机译：隐马尔可夫模型的随机梯度MCMC方法
2. Clustering-based preprocessing method for lipidomic data analysis: application for the evolution of newborn skin surface lipids from birth until 6?months [J] . Rime Michael-Jubeli, Ali Tfayli, Caroline Baudouin, Analytical and bioanalytical chemistry . 2018,第25期

机译：基于聚类的脂质数据分析的预处理方法：施用新生儿皮肤表面脂质从出生中的进化到6？数月
3. Weak approximation of transformed stochastic gradient MCMC [J] . Yokoi Soma, Otsuka Takuma, Sato Issei Machine Learning . 2020,第9a10期

机译：变换随机梯度MCMC的弱近似
4. Stochastic Gradient MCMC Methods for Hidden Markov Models [C] . Yi-An Ma, Nicholas J. Foti, Emily B. Fox International Conference on Machine Learning . 2018

机译：隐藏马尔可夫模型的随机梯度MCMC方法
5. Clustering-Enhanced Stochastic Gradient MCMC for Hidden Markov Models [D] . Ou, Rihui. 2019

机译：隐马尔可夫模型的聚类增强随机梯度MCMC
6. Gradient-free MCMC methods for dynamic causal modelling [O] . Biswa Sengupta, Karl J. Friston, Will D. Penny -1

机译：动态因果建模的无梯度MCMC方法
7. An adaptive Hessian approximated stochastic gradient MCMC method [O] . Yating Wang, Wei Deng, Guang Lin 2021

机译：自适应黑森州近似随机梯度MCMC方法

CPSG-MCMC: Clustering-Based Preprocessing method for Stochastic Gradient MCMC

摘要

著录项

相似文献

相关主题

期刊订阅