Bayes and big data: the consensus Monte Carlo algorithm

Steven L. Scott; Alexander W. Blocker; Fernando V. Bonassi; Hugh A. Chipman; Edward I. George; Robert E. McCulloch

首页> 外文期刊>International journal of management science and engineering management >Bayes and big data: the consensus Monte Carlo algorithm

【24h】

Bayes and big data: the consensus Monte Carlo algorithm

机译：贝叶斯与大数据：共识蒙特卡洛算法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A useful definition of 'big data' is data that is too big to process comfortably on a single machine, either because of processor, memory, or disk bottlenecks. Graphics processing units can alleviate the processor bottleneck, but memory or disk bottlenecks can only be eliminated by splitting data across multiple machines. Communication between large numbers of machines is expensive (regardless of the amount of data being communicated), so there is a need for algorithms that perform distributed approximate Bayesian analyses with minimal communication. Consensus Monte Carlo operates by running a separate Monte Carlo algorithm on each machine, and then averaging individual Monte Carlo draws across machines. Depending on the model, the resulting draws can be nearly indistinguishable from the draws that would have been obtained by running a single-machine algorithm for a very long time. Examples of consensus Monte Carlo are shown for simple models where single-machine solutions are available, for large single-layer hierarchical models, and for Bayesian additive regression trees (BART).

机译：“大数据”的一个有用定义是由于处理器，内存或磁盘瓶颈而导致无法在单个计算机上舒适地处理的数据。图形处理单元可以缓解处理器瓶颈，但是只有通过在多台计算机之间拆分数据，才能消除内存或磁盘瓶颈。大量机器之间的通信非常昂贵（与要传输的数据量无关），因此需要一种算法以最少的通信量执行分布式近似贝叶斯分析。共识性蒙特卡洛的运作方式是在每台机器上运行单独的蒙特卡洛算法，然后平均各个机器上的单个蒙特卡洛绘制。根据模型的不同，生成的绘图可能与通过长时间运行单机算法获得的绘图几乎没有区别。针对具有单机解决方案的简单模型，大型单层层次模型以及贝叶斯加性回归树（BART），显示了共识蒙特卡洛的示例。

著录项

来源
《International journal of management science and engineering management》 |2016年第2期|78-88|共11页
作者
Steven L. Scott; Alexander W. Blocker; Fernando V. Bonassi; Hugh A. Chipman; Edward I. George; Robert E. McCulloch;
展开▼
作者单位

Google, Inc., Mountain View, USA;

Google, Inc., Mountain View, USA;

Google, Inc., Mountain View, USA;

Acadia University, Wolfville, Nova Scotia, Canada;

TheWharton School, University of Pennsylvania, Philadelphia, USA;

Booth School of Business, University of Chicago, Chicago, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Bayesian inference; Markov chain Monte Carlo; distributed computing; big data; embarrassingly parallel;

机译：贝叶斯推理;马尔可夫链蒙特卡洛;分布式计算大数据;尴尬地平行;

相似文献

外文文献
中文文献
专利

1. Empirical Bayes Spatial Prediction Using A Monte Carlo Em Algorithm [J] . Majid Jafari Khaledi, Firoozeh Rivaz Statistical Methods and Applications . 2009,第1期

机译：基于蒙特卡洛Em算法的经验贝叶斯空间预测
2. MOCABA: A general Monte Carlo-Bayes procedure for improved predictions of integral functions of nuclear data [J] . A. Hoefer, O. Buss, M. Hennebach, Annals of nuclear energy . 2015,第mara期

机译：MOCABA：改进蒙特卡罗-贝叶斯程序的核数据积分函数的预测
3. Bayes Monte-Carlo Assessment Method of Protection Systems Reliability Based on Small Failure Sample Data [J] . Dai Z., Wang Z., Jiao Y. Power Delivery, IEEE Transactions on . 2014,第4期

机译：基于小故障样本数据的贝叶斯蒙特卡洛保护系统可靠性评估方法
4. Monte-Carlo algorithms for the improvement of finite-state stochastic controllers: application to Bayes-adaptive Markov decision processes [C] . Michael O. Duff International workshop on Artificial Intellignece and Statistics . 2001

机译：用于改进有限状态随机控制器的Monte-Carlo算法：在Bayes-Adaptive Markov决策过程中的应用
5. CAD based Monte Carlo method: Algorithms for geometric evaluation in support of Monte Carlo radiation transport calculation [D] . Wang, Mengkuo 2006

机译：基于CAD的蒙特卡洛方法：支持蒙特卡洛辐射传输计算的几何评估算法
6. Monte Carlo Simulation-Based Algorithms for Analysis of Shotgun Proteomic Data [O] . Hua Xu, Michael A. Freitas -1

机译：基于蒙特卡罗模拟的Shot弹枪蛋白质组数据分析算法
7. Bayes and big data: the consensus Monte Carlo algorithm [O] . Steven L. Scott, Alexander W. Blocker, Fernando V. Bonassi, 2016

机译：贝叶斯和大数据：共有蒙特卡罗算法
8. Calculated X-ray Intensities Using Monte Carlo Algorithms: A Comparison to Experimental EPMA Data [R] . Carpenter, P. K. 2005

机译：使用蒙特卡罗算法计算X射线强度：与实验Epma数据的比较

Bayes and big data: the consensus Monte Carlo algorithm

摘要

著录项

相似文献

相关主题

期刊订阅