...
首页> 外文期刊>IEEE Transactions on Computers >Population-Based MCMC on Multi-Core CPUs, GPUs and FPGAs
【24h】

Population-Based MCMC on Multi-Core CPUs, GPUs and FPGAs

机译:基于人口的基于MCMC的多核CPU,GPU和FPGA

获取原文
获取原文并翻译 | 示例
           

摘要

Markov Chain Monte Carlo (MCMC) is a method to draw samples from a given probability distribution. Its frequent use for solving probabilistic inference problems, where big-scale data are repeatedly processed, means that MCMC runtimes can be unacceptably large. This paper focuses on population-based MCMC, a popular family of computationally intensive MCMC samplers; we propose novel, highly optimized accelerators in three parallel hardware platforms (multi-core CPUs, GPUs and FPGAs), in order to address the performance limitations of sequential software implementations. For each platform, we jointly exploit the nature of the underlying hardware and the special characteristics of population-based MCMC. We focus particularly on the use of custom arithmetic precision, introducing two novel methods which employ custom precision in the largest part of the algorithm in order to reduce runtime, without causing sampling errors. We apply these methods to all platforms. The FPGA accelerators are up to 114x faster than multi-core CPUs and up to 53x faster than GPUs when doing inference on mixture models.
机译:马尔可夫链蒙特卡洛(MCMC)是一种从给定的概率分布中抽取样本的方法。它经常用于解决概率推理问题,在该问题中重复处理大规模数据,这意味着MCMC运行时可能会过大。本文着重介绍基于人群的MCMC,这是一个受欢迎的计算密集型MCMC采样器系列。我们提出了在三个并行硬件平台(多核CPU,GPU和FPGA)中高度优化的新颖加速器,以解决顺序软件实现的性能限制。对于每个平台,我们共同利用基础硬件的性质以及基于人群的MCMC的特殊特性。我们特别关注自定义算术精度的使用,介绍了两种新颖的方法,它们在算法的最大部分采用了自定义精度,以减少运行时间,而不会引起采样错误。我们将这些方法应用于所有平台。在对混合模型进行推论时,FPGA加速器的速度比多核CPU快114倍,比GPU快53倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号