首页> 外文会议>E-Business and Information System Security, 2009. EBISS '09 >Research of Duplicate Record Cleaning Technology Based on a Reformative Keywords Matching Algorithm
【24h】

Research of Duplicate Record Cleaning Technology Based on a Reformative Keywords Matching Algorithm

机译:基于改进关键字匹配算法的重复记录清理技术研究

获取原文

摘要

Based on the analysis of the insufficiencies of the present Chinese matching algorithms, by examining the characteristics of approximately duplicate records, this paper proposes a method of duplicate record cleaning based on a reformative keywords matching algorithm. Experiments show that this method improves Recall and Precision of duplicate record evidently.
机译:在分析现有中文匹配算法不足的基础上,通过研究近似重复记录的特点,提出了一种基于改进关键词匹配算法的重复记录清除方法。实验表明,该方法明显提高了重复记录的查全率和查准率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号