首页> 外文期刊>Advances in computational sciences and technology >A Comprehensive Review of Significant Researches on Duplicate Record Detection in Databases
【24h】

A Comprehensive Review of Significant Researches on Duplicate Record Detection in Databases

机译:数据库中重复记录检测的重要研究综述

获取原文
获取原文并翻译 | 示例
       

摘要

The abundant amount of data produced and the requirement to merge data from more than one source had resulted in a challenging issue of the efficient detection of duplicate records in databases. Entities possess two or more denotations in real world databases. Generally, duplicate records comprise of errors and are devoid of a common shared key thereby making the task of duplicate matching tedious. A wide variety of methodologies for the identification of duplicate records were projected by numerous researchers. A comprehensive review of the duplicate record detection techniques from significant research works is presented in this paper. An extensive review of the existing literature in duplicate detection of general records in large databases is presented along with the classification. Additionally, a brief introduction about duplicate records detection is presented as well.
机译:产生的大量数据以及合并来自多个来源的数据的要求导致了一个挑战性问题,即如何有效检测数据库中的重复记录。实体在现实世界数据库中具有两个或多个符号。通常,重复记录包含错误并且没有公共的共享密钥,从而使重复匹配的任务繁琐。许多研究人员提出了多种识别重复记录的方法。本文对来自重要研究工作的重复记录检测技术进行了全面回顾。与分类一起,对大型数据库中一般记录的重复检测中的现有文献进行了全面回顾。此外,还介绍了有关重复记录检测的简短介绍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号