首页> 外文会议>IEEE International Conference on Data Engineering >A Gossip-Based System for Fast Approximate Score Computation in Multinomial Bayesian Networks
【24h】

A Gossip-Based System for Fast Approximate Score Computation in Multinomial Bayesian Networks

机译:贝叶斯网络中基于Gossip的快速近似分数计算系统

获取原文

摘要

In this paper, we present a system for fast approximate score computation, a fundamental task for score-based structure learning of multinomial Bayesian networks. Our work is motivated by the fact that exact score computation on large datasets is very time consuming. Our system enables approximate score computation on large datasets in an efficient and scalable manner with probabilistic error bounds on the statistics required for score computation. Our system has several novel features including gossip-based decentralized computation of statistics, lower resource consumption via a probabilistic approach of maintaining statistics, and effective distribution of tasks for score computation using hashing techniques. The demo will provide a real-time and interactive experience to a user on how our system employs the principle of gossiping and hashing techniques in a novel way for fast approximate score computation. The user will be able to control different aspects of our system's execution on a cluster with up to 32 nodes. The approximate scores output by our system can be then used by existing score-based structure learning algorithms.
机译:在本文中,我们提出了一种用于快速近似分数计算的系统,这是多项式贝叶斯网络基于分数的结构学习的一项基本任务。在大型数据集上进行精确得分计算非常耗时,这一事实激发了我们的工作动机。我们的系统能够以高效且可扩展的方式对大型数据集进行近似分数计算,并在分数计算所需的统计数据上具有概率误差范围。我们的系统具有几个新颖的功能,包括基于八卦的分散式统计计算,通过概率统计方法来降低资源消耗,以及使用哈希技术有效分配任务以进行分数计算。该演示将为用户提供实时交互的体验,以了解我们的系统如何以新颖的方式利用闲聊和哈希技术原理进行快速近似分数计算。用户将能够在多达32个节点的集群上控制我们系统执行的不同方面。然后,我们的系统输出的近似分数可以被现有的基于分数的结构学习算法使用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号