【24h】

An Efficient Causal Discovery Algorithm for Linear Models

机译:线性模型的有效因果发现算法

获取原文

摘要

Bayesian network learning algorithms have been widely used for causal discovery since the pioneer work [13, 18]. Among all existing algorithms, three-phase dependency analysis algorithm (TPDA) [5] is the most efficient one in the sense that it has polynomial-time complexity. However, there are still some limitations to be improved. First, TPDA depends on mutual information-based conditional independence (CI) tests, and so is not easy to be applied to continuous data. In addition, TPDA uses two phases to get approximate skeletons of Bayesian networks, which is not efficient in practice. In this paper, we propose a two-phase algorithm with partial correlation-based CI tests: the first phase of the algorithm constructs a Markov random field from data, which provides a close approximation to the structure of the true Bayesian network; at the second phase, the algorithm removes redundant edges according to CI tests to get the true Bayesian network. We show that two-phase algorithm with partial correlation-based CI tests can deal with continuous data following arbitrary distributions rather than only Gaussian distribution.
机译:自开创性工作以来,贝叶斯网络学习算法已广泛用于因果发现[13,18]。在所有现有算法中,三相依赖性分析算法(TPDA)[5]从多项式时间复杂度的角度来看是最有效的算法。但是,仍有一些限制需要改进。首先,TPDA依赖于基于互信息的条件独立性(CI)测试,因此不容易应用于连续数据。另外,TPDA使用两个阶段来获取贝叶斯网络的近似骨架,这在实践中效率不高。在本文中,我们提出了一种带有基于部分相关性CI检验的两阶段算法:该算法的第一阶段从数据构造一个马尔可夫随机场,这为真实贝叶斯网络的结构提供了一个近似的近似值。在第二阶段,该算法根据CI测试去除冗余边缘,以获得真实的贝叶斯网络。我们表明,具有基于部分相关性CI检验的两阶段算法可以处理遵循任意分布的连续数据,而不仅仅是高斯分布。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号