首页> 外文会议>International workshop on Multi-relational mining >Bias-free hypothesis evaluation in multirelational domains
【24h】

Bias-free hypothesis evaluation in multirelational domains

机译:多层域的无偏见假设评估

获取原文

摘要

In propositional domains, using a separate test set via random sampling or cross validation is generally considered to be an unbiased estimator of true error. In multirelational domains, previous work has already noted that linkage of objects may cause these procedures to be biased, and has proposed corrected sampling procedures. However, as we show in this paper, the existing procedures only address one particular case of bias introduced by linkage. We recall that in the propositional case cross validation measures off-training set (OTS) error and not true error and illustrate the difference with a small experiment. In the multirelational case, we show that the distinction between training and test set needs to be carefully extended based on a graph of potentially linked objects, and on their assumed probabilities of reoccurrence. We demonstrate that the bias due to linkage to known objects varies with the chosen proportion of the training/test split and present an algorithm, generalized subgraph sampling, that is guaranteed to avoid bias in the test set for more generalized cases.
机译:在命题域中,使用随机采样或交叉验证的单独测试集通常被认为是真正误差的无偏估计。在多层主义的域中,先前的工作已经指出,对象的链接可能导致这些程序偏见,并提出了纠正的采样程序。但是,正如我们在本文中所展示的那样,现有程序只能解决连锁引入的一个特定偏差案例。我们记得,在命题案例交叉验证中,禁止训练集(OTS)错误,而不是真正的错误,并说明与小实验的差异。在多层式案例中,我们表明,需要根据潜在的链接对象的图表以及其假设的再次概率来仔细扩展训练和测试集之间的区别。我们证明,由于与已知对象的链接引起的偏差随着训练/测试分裂的选择比例而变化,并且呈现了一种算法,广义子图采样,这是保证避免测试集中的偏差广义案件。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号