首页> 外文期刊>Knowledge and Data Engineering, IEEE Transactions on >i src='/images/tex/413.gif' alt='^2'> MapReduce: Incremental MapReduce for Mining Evolving Big Data
【24h】

i src='/images/tex/413.gif' alt='^2'> MapReduce: Incremental MapReduce for Mining Evolving Big Data

机译:i src =“ / images / tex / 413.gif” alt =“ ^ 2”> MapReduce:增量MapReduce用于挖掘不断发展的大数据

获取原文
获取原文并翻译 | 示例
           

摘要

As new data and updates are constantly arriving, the results of data mining applications become stale and obsolete over time. Incremental processing is a promising approach to refreshing mining results. It utilizes previously saved states to avoid the expense of re-computation from scratch. In this paper, we propose iMapReduce, a novel incremental processing extension to MapReduce, the most widely used framework for mining big data. Compared with the state-of-the-art work on Incoop, iMapReduce (i) performs key-value pair level incremental processing rather than task level re-computation, (ii) supports not only one-step computation but also more sophisticated iterative computation, which is widely used in data mining applications, and (iii) incorporates a set of novel techniques to reduce I/O overhead for accessing preserved fine-grain computation states. We evaluate iMapReduce using a one-step algorithm and four iterative algorithms with diverse computation characteristics. Experimental results on Amazon EC2 show significant performance improvements of i MapReduce compared to both plain and iterative MapReduce performing re-computation.
机译:随着新数据和更新的不断到来,数据挖掘应用程序的结果将随着时间的推移变得陈旧和过时。增量处理是一种刷新采矿结果的有前途的方法。它利用先前保存的状态来避免从头开始进行重新计算的开销。在本文中,我们提出了iMapReduce,它是MapReduce(一种用于挖掘大数据的最广泛使用的框架)的新颖的增量处理扩展。与Incoop上的最新技术相比,iMapReduce(i)执行键值对级别的增量处理而不是任务级别的重新计算;(ii)不仅支持单步计算,还支持更复杂的迭代计算,广泛用于数据挖掘应用程序,并且(iii)结合了一套新颖的技术,以减少用于访问保留的细粒度计算状态的I / O开销。我们使用一步算法和具有不同计算特性的四种迭代算法评估iMapReduce。 Amazon EC2上的实验结果表明,与普通和迭代MapReduce进行重新计算相比,i MapReduce的性能有了显着提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号