首页> 外文期刊>高分子論文集 >Who learns better Bayesian network structures: Accuracy and speed of structure learning algorithms
【24h】

Who learns better Bayesian network structures: Accuracy and speed of structure learning algorithms

机译:谁能学习更好的贝叶斯网络结构:结构学习算法的准确性和速度

获取原文
获取原文并翻译 | 示例
           

摘要

Three classes of algorithms to learn the structure of Bayesian networks from data are common in the literature: constraint-based algorithms, which use conditional independence tests to learn the dependence structure of the data; score-based algorithms, which use goodness-of-fit scores as objective functions to maximise; and hybrid algorithms that combine both approaches. Constraint-based and score-based algorithms have been shown to learn the same structures when conditional independence and goodness of fit are both assessed using entropy and the topological ordering of the network is known [1].In this paper, we investigate how these three classes of algorithms perform outside the assumptions above in terms of speed and accuracy of network reconstruction for both discrete and Gaussian Bayesian networks. We approach this question by recognising that structure learning is defined by the combination of a statistical criterion and an algorithm that determines how the criterion is applied to the data. Removing the confounding effect of different choices for the statistical criterion, we find using both simulated and real-world complex data that constraint-based algorithms are often less accurate than score-based algorithms, but are seldom faster (even at large sample sizes); and that hybrid algorithms are neither faster nor more accurate than constraint-based algorithms. This suggests that commonly held beliefs on structure learning in the literature are strongly influenced by the choice of particular statistical criteria rather than just by the properties of the algorithms themselves. (C) 2019 Elsevier Inc. All rights reserved.
机译:从数据中学习贝叶斯网络结构的三类算法很常见:基于约束的算法,它使用条件独立性测试来学习数据的依存结构;基于得分的算法,该算法使用拟合优度得分作为目标函数来最大化;以及结合了两种方法的混合算法。当使用熵评估网络的拓扑独立性和条件独立性和拟合优度时,基于约束和基于分数的算法已被证明可以学习相同的结构[1]。在本文中,我们研究了这三种方法在离散和高斯贝叶斯网络的网络重建速度和准确性方面,各种算法都在上述假设之外执行。我们通过认识到结构学习是由统计标准和确定该标准如何应用于数据的算法的组合来解决这个问题的。除去统计标准的不同选择的混淆影响,我们发现使用模拟和真实世界的复杂数据,基于约束的算法通常不如基于分数的算法准确,但是却很少更快(即使在大样本量下);而且混合算法既不比基于约束的算法更快也不更准确。这表明,文献中关于结构学习的普遍信念受到特定统计标准的选择的强烈影响,而不仅仅是算法本身的属性。 (C)2019 Elsevier Inc.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号