首页> 外文会议>International conference on information engineering and applications >Detecting Approximately Duplicate Records in Database
【24h】

Detecting Approximately Duplicate Records in Database

机译:在数据库中检测大约重复的记录

获取原文

摘要

The existing database system data quantity is huge, many of which are repeated data. Using the traditional approach for detecting approximately duplicate records to find similar duplicate records in the database will involve very large time complexity and space complexity, unable to obtain very good results. This chapter presents a method based on improved genetic neural network approach for detecting approximately duplicate records, using genetic algorithm to optimize the network's initial weights; and then using the BP algorithm to train the detection data to obtain network model. The experimental results show that this method can effectively solve the huge amount of approximately duplicate record data detection problem.
机译:现有的数据库系统数据量巨大,其中许多是重复数据。使用传统的方法来检测近似重复的记录以在数据库中找到相似的重复记录将涉及非常大的时间复杂度和空间复杂度,无法获得非常好的结果。本章提出了一种基于改进遗传神经网络的方法,用于检测近似重复的记录,并使用遗传算法来优化网络的初始权重。然后使用BP算法训练检测数据以获得网络模型。实验结果表明,该方法可以有效地解决大量重复记录数据的检测问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号