首页> 外文会议>IEEE International Conference on Data Engineering >A Gossip-Based System for Fast Approximate Score Computation in Multinomial Bayesian Networks
【24h】

A Gossip-Based System for Fast Approximate Score Computation in Multinomial Bayesian Networks

机译:基于GOSSIP的系统,用于多项式贝叶斯网络中的快速近似分数计算

获取原文

摘要

In this paper, we present a system for fast approximate score computation, a fundamental task for score-based structure learning of multinomial Bayesian networks. Our work is motivated by the fact that exact score computation on large datasets is very time consuming. Our system enables approximate score computation on large datasets in an efficient and scalable manner with probabilistic error bounds on the statistics required for score computation. Our system has several novel features including gossip-based decentralized computation of statistics, lower resource consumption via a probabilistic approach of maintaining statistics, and effective distribution of tasks for score computation using hashing techniques. The demo will provide a real-time and interactive experience to a user on how our system employs the principle of gossiping and hashing techniques in a novel way for fast approximate score computation. The user will be able to control different aspects of our system's execution on a cluster with up to 32 nodes. The approximate scores output by our system can be then used by existing score-based structure learning algorithms.
机译:在本文中,我们提出了一种快速近似分数计算系统,这是多项式贝叶斯网络的得分基于结构学习的基本任务。我们的工作是激励的,即大型数据集的确切得分计算非常耗时。我们的系统在大型数据集中能够以有效且可扩展的方式对大型数据集进行近似分数计算,以概率计算得分计算所需的统计信息。我们的系统具有几个新颖的功能,包括基于八卦的分散计算的统计数据,通过维持统计的概率方法,以及使用散列技术有效分配任务的任务分配。演示将为用户提供实时和互动体验,以便我们的系统以新颖的方式采用闲聊和散列技术原则,以便快速近似分数计算。用户将能够控制我们在最多32个节点上的群集中执行系统执行的不同方面。然后,我们的系统输出的近似分数可以由现有的基于分数的结构学习算法使用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号