首页> 外文期刊>International Journal of Computers & Applications >INCREMENTAL OBJECT MATCHING APPROACH OF SCHEMA-FREE DATA WITH MAPREDUCE
【24h】

INCREMENTAL OBJECT MATCHING APPROACH OF SCHEMA-FREE DATA WITH MAPREDUCE

机译:带有映射还原的无模式数据的增量对象匹配方法

获取原文
获取原文并翻译 | 示例
       

摘要

Object matching is a traditional deduplication approach that is used to identify the duplicates within one or several data sets respectively. However, this task will become difficult in the context of big data. Few publications have mentioned incremental methods to solve this legacy issue in parallel. To address this limitation, we aim to propose an incremental object matching approach (IOMMapReduce) in parallel. We investigate possible solutions to improve current object matching approaches with MapReduce, to make it support incrementality to speed up the deduplication process. Finally, our experimental evaluation on large data sets shows the high effectiveness and efficiency of the proposed approaches.
机译:对象匹配是一种传统的重复数据删除方法,用于分别标识一个或几个数据集中的重复数据。但是,在大数据的情况下,此任务将变得困难。很少有出版物提到增量方法来并行解决此遗留问题。为了解决此限制,我们旨在提出一种并行的增量对象匹配方法(IOMMapReduce)。我们研究了可能的解决方案,以使用MapReduce改进当前的对象匹配方法,使其支持增量性以加快重复数据删除过程。最后,我们对大型数据集的实验评估表明了所提出方法的高效性和有效性。

著录项

  • 来源
  • 作者

    Kun Ma; Fusen Dong; Bo Yang;

  • 作者单位

    Shandong Provincial Key Laboratory of Network Based Intelligent Computing, University of Jinan, China;

    Shandong Provincial Key Laboratory of Network Based Intelligent Computing, University of Jinan, China;

    Shandong vincial Key Laboratory of Network Based Intelligent Computing, University of Jinan, China;

  • 收录信息 美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Object matching; MapReduce; sorted neighbourhood; big data; NoSQL;

    机译:对象匹配;MapReduce;分类的街区;大数据;NoSQL;
  • 入库时间 2022-08-18 00:39:04

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号