首页> 外文期刊>Journal of Parallel and Distributed Computing >IoTDeM: An IoT Big Data-oriented MapReduce performance prediction extended model in multiple edge clouds
【24h】

IoTDeM: An IoT Big Data-oriented MapReduce performance prediction extended model in multiple edge clouds

机译:IoTDeM:面向IoT大数据的MapReduce性能预测扩展模型在多个边缘云中

获取原文
获取原文并翻译 | 示例
           

摘要

Uploading all loT Big Data to a centralized cloud for data analytics is infeasible because of the excessive latency and bandwidth limitation of the Internet. A promising approach to addressing the challenges for data analytics in loT is "edge cloud" that pushes various computing and data analysis capabilities to multiple edge clouds. MapReduce provides an efficient way to deal with a large amount of data. When performing data analysis, a challenge is to predict the performance of MapReduce jobs. In this paper, we propose and evaluate IoTDeM, which is an extended IoT Big Data-oriented model for predicting MapReduce performance m multiple edge clouds. IoTDeM is able to predict MapReduce jobs' total execution time in a general implementation scenario with varying reduce amounts and cluster scales in Hadoop 2, rather than Hadoop 1. The extended model is based on historical job execution records and Locally Weighted Linear Regression (LWLR) techniques to predict the execution time of each job. Through extracting more representative features to represent a job, the IoTDeM model selects a cluster scale as a crucial parameter to further extend LWLR model. In the environment of Hadoop 2 with Ceph as the storage system, the experiments verify IoTDeM can effectively predict the total execution time of MapReduce applications, with the average relative error of less than 10%. (C) 2017 Elsevier Inc. All rights reserved.
机译:由于Internet的过多延迟和带宽限制,将所有大数据上传到集中式云进行数据分析是不可行的。解决边缘数据挑战的一种有前途的方法是“边缘云”,它将各种计算和数据分析功能推向多个边缘云。 MapReduce提供了一种有效的方式来处理大量数据。执行数据分析时,面临的挑战是预测MapReduce作业的性能。在本文中,我们提出并评估了IoTDeM,这是一个扩展的面向IoT大数据的模型,用于预测多个边缘云中的MapReduce性能。 IoTDeM能够在Hadoop 2(而不是Hadoop 1)中以不同的减少数量和集群规模来预测在一般实施方案中MapReduce作业的总执行时间。扩展模型基于历史作业执行记录和本地加权线性回归(LWLR)预测每个作业的执行时间的技术。通过提取更多具有代表性的特征来代表工作,IoTDeM模型选择了集群规模作为进一步扩展LWLR模型的关键参数。在以Ceph为存储系统的Hadoop 2环境中,实验证明IoTDeM可以有效预测MapReduce应用程序的总执行时间,平均相对误差小于10%。 (C)2017 Elsevier Inc.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号