首页> 外文会议>International Conference on Network and System Security >An Efficient and Effective Duplication Detection Method in Large Database Applications
【24h】

An Efficient and Effective Duplication Detection Method in Large Database Applications

机译:大型数据库应用中有效且有效的复制检测方法

获取原文

摘要

In this paper, we developed a robust data cleaning technique, called PC-Filter+ (PC stands for partition comparison) based on its predecessor, for effective and efficient duplicate record detection in large databases. PC-Filter+ provides more flexible algorithmic options for constructing the Partition Comparison Graph (PCG). In addition, PC-Filter+ is able to deal with duplicate detection under different memory constraints.
机译:在本文中,我们开发了一种强大的数据清理技术,称为PC-Filter +(PC代表分区比较),基于其前述者,在大型数据库中有效和有效的重复记录检测。 PC-Filter +提供更灵活的算法选项,用于构建分区比较图(PCG)。此外,PC-Filter +能够在不同的内存约束下处理重复检测。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号