...
首页> 外文期刊>SIGKDD explorations >Generalized Score Functions for Causal Discovery
【24h】

Generalized Score Functions for Causal Discovery

机译:因果发现的广义得分函数

获取原文
获取原文并翻译 | 示例
           

摘要

Discovery of causal relationships from observational data is a fundamental problem. Roughly speaking, there are two types of methods for causal discovery, constraint-based ones and score-based ones. Score-based methods avoid the multiple testing problem and enjoy certain advantages compared to constraint-based ones. However, most of them need strong assumptions on the functional forms of causal mechanisms, as well as on data distributions, which limit their applicability. In practice the precise information of the underlying model class is usually unknown. If the above assumptions are violated, both spurious and missing edges may result. In this paper, we introduce generalized score functions for causal discovery based on the characterization of general (conditional) independence relationships between random variables, without assuming particular model classes. In particular, we exploit regression in RKHS to capture the dependence in a nonparametric way. The resulting causal discovery approach produces asymptotically correct results in rather general cases, which may have nonlinear causal mechanisms, a wide class of data distributions, mixed continuous and discrete data, and multidimensional variables. Experimental results on both synthetic and real-world data demonstrate the efficacy of our proposed approach.
机译:发现来自观察数据的因果关系是一个基本问题。粗略地说,有两种类型的因果发现,基于约束和基于分数的方法。基于刻度的方法避免了多个测试问题,与基于约束的基础相比享有某些优点。但是,大多数人都需要对因果机制的功能形式的强烈假设,以及限制其适用性的数据分布。在实践中,底层模型类的精确信息通常是未知的。如果违反上述假设,则可能导致虚假和缺失的边缘。在本文中,我们基于随机变量之间的一般(条件)独立关系的表征来介绍因果区的概念发现,而不假设特定的模型类。特别是,我们利用RKH中的回归来捕获非参数的依赖。由此产生的因果发现方法在相当一般情况下产生渐近的正确结果,这可能具有非线性因果机制,广泛的数据分布,混合连续和离散数据以及多维变量。综合性和现实世界数据的实验结果表明了我们提出的方法的功效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号