首页> 外文期刊>International Journal of Computer Science and Engineering >Big Data Mining For Interesting Pattern Using MapReduced Technique
【24h】

Big Data Mining For Interesting Pattern Using MapReduced Technique

机译:使用MapReduce技术的有趣模式的大数据挖掘

获取原文
           

摘要

In the past few years, huge data are need to be stored, access and retrieved, that has increased drastically all over the world, this fast growth of data results in the need to analyse the huge amount of data. Due to lack of proper tools and programs, data remains unused and unutilized with important useful knowledge hidden. This study has carryout data mining interesting patterns in big data. Objectoriented design methodology was used. Frequent pattern growth algorithm on Hadoop using MapReduce has been used and particularly applied it to analyze maximum flight time in flight transaction data store of 108MB. MapReduce program consists of two functions Mapper and Reducer which runs on all machines in a Hadoop cluster. System was implemented in matlab. Computation has been performed to analyzed the actual flight time using user constraints, the constraints are arrival delay and actual elapse time. Airpeace carrier has the longest flight time, the analyzed carrier (Air peace) space was 20000x6 contained 712316 bytes. Thus, the execution time of the entire mining process was 1615 milliseconds.
机译:在过去的几年里,需要存储,访问和检索巨大数据,这增加了世界各地的,这种快速增长的数据导致需要分析大量数据。由于缺乏适当的工具和程序,数据仍然未被利用并且未经利用隐藏的重要知识。本研究在大数据中具有携带数据挖掘有趣模式。使用了非对象的设计方法。使用MapReduce的Hadoop频繁模式增长算法已被使用,特别适用于分析108MB的飞行交易数据存储中的最大飞行时间。 MapReduce程序由两个函数映射器和减速器组成,可在Hadoop集群中的所有计算机上运行。系统在Matlab中实施。已经执行计算以分析使用用户约束的实际飞行时间,约束是到达延迟和实际经验时间。 Airpecece载体的飞行时间最长,分析的承运人(Air和Peach)空间为20000x6载有712316字节。因此,整个采矿过程的执行时间为1615毫秒。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号