首页> 外文期刊>Journal of machine learning research >FLAME: A Fast Large-scale Almost Matching Exactly Approach to Causal Inference
【24h】

FLAME: A Fast Large-scale Almost Matching Exactly Approach to Causal Inference

机译:火焰:快速大规模几乎与因果推断完全相同

获取原文
           

摘要

A classical problem in causal inference is that of matching, where treatment units need to be matched to control units based on covariate information. In this work, we propose a method that computes high quality almost-exact matches for high-dimensional categorical datasets. This method, called FLAME (Fast Large-scale Almost Matching Exactly), learns a distance metric for matching using a hold-out training data set. In order to perform matching efficiently for large datasets, FLAME leverages techniques that are natural for query processing in the area of database management, and two implementations of FLAME are provided: the first uses SQL queries and the second uses bit-vector techniques. The algorithm starts by constructing matches of the highest quality (exact matches on all covariates), and successively eliminates variables in order to match exactly on as many variables as possible, while still maintaining interpretable high-quality matches and balance between treatment and control groups. We leverage these high quality matches to estimate conditional average treatment effects (CATEs). Our experiments show that FLAME scales to huge datasets with millions of observations where existing state-of-the-art methods fail, and that it achieves significantly better performance than other matching methods.
机译:因果推断中的经典问题是匹配的,其中需要基于协变量信息与控制单元匹配。在这项工作中,我们提出了一种计算高质量几乎精确匹配的方法对高维分类数据集。这种调用火焰(完全匹配的快速大规模)的方法,了解使用阻止训练数据集匹配的距离度量。为了有效地执行大型数据集的匹配,火焰利用了在数据库管理区域中进行查询处理的自然的技术,并且提供了两种火焰的实现:首先使用SQL查询,第二个使用位矢量技术。该算法通过构建最高质量的匹配(所有协变量上的完全匹配),并且连续消除变量,以便尽可能多地匹配,同时仍保持可解释的高质量匹配和治疗和控制组之间的平衡。我们利用这些高质量的比赛来估计条件平均治疗效果(凯特)。我们的实验表明,具有数百万个观察的火焰缩小到巨大的数据集,其中现有的最先进的方法失败,并且它实现的性能明显多于其他匹配方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号