首页> 外文期刊>Neural Networks and Learning Systems, IEEE Transactions on >An Efficient Entropy-Based Causal Discovery Method for Linear Structural Equation Models With IID Noise Variables
【24h】

An Efficient Entropy-Based Causal Discovery Method for Linear Structural Equation Models With IID Noise Variables

机译:具有IID噪声变量的线性结构方程模型的基于高效的基于熵的因果解法方法

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

The discovery of causal relationships from the observational data is an important task. To identify the unique causal structure belonging to a Markov equivalence class, a number of algorithms, such as the linear non-Gaussian acyclic model (LiNGAM), have been proposed. However, two challenges remain to be met: 1) these algorithms fail to work on the data which follow linear structural equation model with Gaussian noise and 2) they misjudge the causal direction when the data contain additional measurement errors. In this paper, we propose an entropy-based two-phase iterative algorithm for arbitrary distribution data with additional measurement errors under some mild assumptions. In the first phase of the algorithm, based on the property that entropy can measure the amount of information behind the data with arbitrary distribution, we design a general approach for the identification of exogenous variable on both Gaussian and non-Gaussian data, and we give the corresponding theoretical derivation. In the second phase, to eliminate the effects of measurement errors, we revise the value of the exogenous variable by removing its measurement error and further use the revised value to remove its effect on the remaining variables. Experimental results on real-world causal structures are presented to demonstrate the effectiveness and stability of our method. We also apply the proposed algorithm on the mobile-base-station data with measurement errors, and the results further prove the effectiveness of our algorithm.
机译:从观察数据中发现因果关系是一个重要任务。为了识别属于Markov等效类的独特原因结构,已经提出了许多算法,例如线性非高斯联态模型(Lingam)。然而,仍有可能会满足的两个挑战:1)这些算法无法在遵循具有高斯噪声的线性结构方程模型的数据上工作,并且当数据包含额外的测量误差时,它们会误导因果方向。在本文中,我们提出了一种基于熵的两相迭代算法,用于任意分布数据,其在一些温和的假设下具有额外的测量误差。在算法的第一阶段,基于熵可以测量数据背后的信息量,我们设计了一种识别高斯和非高斯数据的外源变量的一般方法,我们给予相应的理论推导。在第二阶段,为了消除测量误差的影响,我们通过去除其测量误差来修改外源变量的值,并进一步使用修改的值来消除其对剩余变量的影响。提出了实验结果,以证明我们方法的有效性和稳定性。我们还使用测量误差将所提出的算法应用于移动基站数据,结果进一步证明了我们算法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号