...
首页> 外文期刊>Neural processing letters >Low-Dimensional Perturb-and-MAP Approach for Learning Restricted Boltzmann Machines
【24h】

Low-Dimensional Perturb-and-MAP Approach for Learning Restricted Boltzmann Machines

机译:用于学习受限制的Boltzmann机器的低维扰动和地图方法

获取原文
获取原文并翻译 | 示例
           

摘要

This paper introduces a new approach to maximum likelihood learning of the parameters of a restricted Boltzmann machine (RBM). The proposed method is based on the Perturb-and-MAP (PM) paradigm that enables sampling from the Gibbs distribution. PM is a two step process: (i) perturb the model using Gumbel perturbations, then (ii) find the maximum a posteriori (MAP) assignment of the perturbed model. We show that under certain conditions the resulting MAP configuration of the perturbed model is an unbiased sample from the original distribution. However, this approach requires an exponential number of perturbations, which is computationally intractable. Here, we apply an approximate approach based on the first order (low-dimensional) PM to calculate the gradient of the log-likelihood in binary RBM. Our approach relies on optimizing the energy function with respect to observable and hidden variables using a greedy procedure. First, for each variable we determine whether flipping this value will decrease the energy, and then we utilize the new local maximum to approximate the gradient. Moreover, we show that in some cases our approach works better than the standard coordinate-descent procedure for finding the MAP assignment and compare it with the Contrastive Divergence algorithm. We investigate the quality of our approach empirically, first on toy problems, then on various image datasets and a text dataset.
机译:本文介绍了一种最大似然学习的最大似然学习的新方法(RBM)。所提出的方法基于扰动和地图(PM)范式,使得能够从GIBBS分布中采样。 PM是一个两个步骤过程:(i)使用gumbel扰动扰乱模型,然后(ii)找到扰动模型的最大后验(map)分配。我们表明,在某些条件下,扰动模型的结果地图配置是来自原始分布的无偏析的样本。然而,这种方法需要指数扰动,这是计算地棘手的。这里,我们基于第一阶(低维)PM来应用近似方法,以计算二进制RBM中的日志似然的梯度。我们的方法依赖于使用贪婪程序对可观察和隐藏变量的优化能量函数。首先,对于每个变量,我们确定翻转此值是否会降低能量,然后我们利用新的局部最大值来近似梯度。此外,我们表明,在某些情况下,我们的方法优于找到地图分配的标准坐标缩减程序,并将其与对比分解算法进行比较。我们经验上调查了我们的方法的质量,首先在玩具问题上,然后在各种图像数据集和文本数据集上。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号