首页> 外文会议>International Conference on Advanced Design and Manufacturing Engineering >Detection for Approximately Duplicate Records Based on Fuzzy Comprehensive Evaluation
【24h】

Detection for Approximately Duplicate Records Based on Fuzzy Comprehensive Evaluation

机译:基于模糊综合评估检测近似重复记录

获取原文

摘要

To solve the problem of attribute weight determination in the approximately duplicate records, we put forward a method based on fuzzy comprehensive evaluation to get attribute weight in data set. We first perform an analysis of the composition factors of attribute. Then we carry out an evaluation of their rank. Finally, we make a determination of the attribute weight using the fuzzy comprehensive evaluation method, on the basis of which the approximately duplicate records are detected. Theoretical analysis and experimental results show that the method can objectively determine all attributes weight, and effectively detect the approximately duplicate records in massive data set.
机译:为了解决大致重复记录中的属性权重确定问题,我们提出了一种基于模糊综合评估的方法来获取数据集中的属性权重。我们首先对属性的组成因子进行分析。然后我们对他们的等级进行评估。最后,我们使用模糊综合评估方法确定属性权重,基于该方法检测到近似重复的记录。理论分析和实验结果表明,该方法可以客观地确定所有属性权重,有效地检测大规模数据集中的近似重复记录。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号