首页> 外文会议>IEEE International Conference on Data Mining Workshops >A Parallel Implementation of Idea Graph to Extract Rare Chances from Big Data
【24h】

A Parallel Implementation of Idea Graph to Extract Rare Chances from Big Data

机译:从大数据中提取稀有机会的想法图的并行实现

获取原文
获取外文期刊封面目录资料

摘要

In current days, data tend to become much bigger than before, and the distributed computing system is an prevalent option to deal with them. As one of powerful tools, MapReduce framework provides a cheap and efficient way to write parallel programs to run on distributed computing systems. Chance discovery (CD) is an extension of data mining, where chance refers to rare but important events or situations. Idea Graph is an efficient algorithm proposed to detect chances. However, the traditional implementation of Idea Graph is sequential, and its performance encounters some bottlenecks when dealing with big data. In this paper, we propose a parallel implementation of Idea Graph using MapReduce to better meet with the challenge of big data. First, we introduce the MapReduce framework, and then Idea Graph is introduced in brief. After that, we present the details on how we design the parallel Idea Graph implementation. In the end of the paper, several experiments are conducted to evaluate the proposed implementation. The experimental results demonstrate the validation of the proposed implementation and its better performance as compared with that of sequential Idea Graph implementation when handling big data.
机译:如今,数据趋向于变得比以前大得多,而分布式计算系统是处理这些数据的普遍选择。作为功​​能强大的工具之一,MapReduce框架提供了一种便宜而有效的方式来编写可在分布式计算系统上运行的并行程序。机会发现(CD)是数据挖掘的扩展,机会是指罕见但重要的事件或情况。 Idea Graph是提出用于检测机会的有效算法。但是,Idea Graph的传统实现是顺序的,并且在处理大数据时其性能会遇到一些瓶颈。在本文中,我们提出了使用MapReduce的Idea Graph的并行实现,以更好地应对大数据的挑战。首先,我们介绍MapReduce框架,然后简要介绍Idea Graph。之后,我们将详细介绍如何设计并行的Idea Graph实现。在本文的最后,进行了一些实验以评估所提出的实施方案。实验结果表明,与处理大数据时相继的Idea Graph实现相比,该实现的有效性得到了验证。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号