...
首页> 外文期刊>IEICE transactions on information and systems >Mining Approximate Primary Functional Dependency on Web Tables
【24h】

Mining Approximate Primary Functional Dependency on Web Tables

机译:在Web表上挖掘近似的主要功能依赖性

获取原文
           

摘要

We propose to discover approximate primary functional dependency (aPFD) for web tables, which focus on the determination relationship between primary attributes and non-primary attributes and are more helpful for entity column detection and topic discovery on web tables. Based on association rules and information theory, we propose metrics Conf and InfoGain to evaluate PFDs. By quantifying PFDs' strength and designing pruning strategies to eliminate false positives, our method could select minimal non-trivial approximate PFD effectively and are scalable to large tables. The comprehensive experimental results on real web datasets show that our method significantly outperforms previous work in both effectiveness and efficiency.
机译:我们建议发现Web表格的近似主要功能依赖关系(aPFD),该功能着重于主要属性和非主要属性之间的确定关系,并且对于实体列检测和Web表格上的主题发现更有用。基于关联规则和信息理论,我们提出度量 Conf和 InfoGain以评估PFD。通过量化PFD的强度并设计修剪策略以消除误报,我们的方法可以有效地选择最小的非平凡近似PFD,并且可以扩展到大表。在真实Web数据集上的综合实验结果表明,我们的方法在有效性和效率上都大大优于以前的工作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号