【24h】

Bias-Free Hypothesis Evaluation in Multirelational Domains

机译:多关系域中的无偏差假设评估

获取原文
获取原文并翻译 | 示例

摘要

In propositional domains using a separate test set via random sampling or cross validation is generally considered to be an unbiased estimator of true error. In multirelational domains previous work has already noted that linkage of objects may cause these procedures to be biased and has proposed corrected sampling procedures. However, as we show in this paper, the existing procedures only address one particular case of bias introduced by linkage. In this paper we therefore introduce generalized subgraph sampling, a sampling procedure based on bin packing, which ensures that test sets are properly chosen to match the probability of reencountering previously seen objects and which includes previous approaches as a special case. Experiments with data from the Internet Movie Database illustrate the performance of our algorithm.
机译:在命题域中,通过随机抽样或交叉验证使用单独的测试集通常被认为是真实误差的无偏估计量。在多关系域中,先前的工作已经指出,对象的链接可能会导致这些过程产生偏差,并提出了更正后的采样过程。但是,正如我们在本文中所显示的那样,现有程序仅解决了链接引起的一种特殊情况。因此,在本文中,我们介绍了广义子图采样,这是一种基于bin装箱的采样过程,可确保正确选择测试集以匹配重新遇到先前看到的对象的概率,并包括以前的方法作为特殊情况。对来自Internet电影数据库的数据进行的实验说明了我们算法的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号