...
首页> 外文期刊>Journal of information and computational science >Blocking Records Based on Triangle-Inequality
【24h】

Blocking Records Based on Triangle-Inequality

机译:基于三角不等式阻止记录

获取原文
获取原文并翻译 | 示例

摘要

Deduplication and automatize the matching process is one of the most important research branch of database field. The paper proposes a novel method working in metrics space. Based on the triangle inequality theorem, the method blocks records into different, sets. Paper further proposes a multiple iterative mechanism and gives corresponding algorithms. The paper analyzes the complexity of the method and experimentally verifies the efficiency of the method. Compared with traditional methods, it improves the accuracy, precision and recall.
机译:重复数据删除和匹配过程的自动化是数据库领域最重要的研究分支之一。本文提出了一种在度量空间中工作的新方法。该方法基于三角不等式定理,将记录分为不同的集合。论文进一步提出了一种多重迭代机制,并给出了相应的算法。本文分析了该方法的复杂性,并通过实验验证了该方法的有效性。与传统方法相比,它提高了准确性,准确性和查全率。

著录项

  • 来源
    《Journal of information and computational science 》 |2010年第11期| p.2224-2231| 共8页
  • 作者单位

    College of Information System and Management. National University of Defense Technology Changsha 410073. China;

    College of Information System and Management. National University of Defense Technology Changsha 410073. China;

    College of Information System and Management. National University of Defense Technology Changsha 410073. China;

    College of Information System and Management. National University of Defense Technology Changsha 410073. China;

    College of Information System and Management. National University of Defense Technology Changsha 410073. China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    record matching; blocking: metric; similarity;

    机译:记录匹配;封锁:指标;相似;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号