...
首页> 外文期刊>Knowledge and information systems >Heuristic attribute reduction and resource-saving algorithm for energy data of data centers
【24h】

Heuristic attribute reduction and resource-saving algorithm for energy data of data centers

机译:数据中心能量数据的启发式属性减少和资源节约算法

获取原文
获取原文并翻译 | 示例
           

摘要

Energy data, which consist of energy consumption statistics and other related data in green data centers, grow dramatically. The energy data have great value, but many attributes within them are redundant and unnecessary, and they have a serious impact on the performance of the data center's decision-making system. Thus, attribute reduction for the energy data has been conceived as a critical step. However, many existing attribute reduction algorithms are often computationally time-consuming. To address these issues, firstly, we extend the methodology of rough sets to construct data center energy consumption knowledge representation system. Energy data will occur some degree of exceptions caused by power failure, energy instability or other factors; hence, we design an integrated data preprocessing method using Spark for energy data, which mainly includes sampling analysis, data classification, missing data filling, outlier data prediction and data discretization. By taking good advantage of in-memory computing, a fast heuristic attribute reduction algorithm (FHARA-S) for energy data using Spark is proposed. In this algorithm, we use an efficient algorithm for transforming energy consumption decision table, a heuristic formula for measuring the significance of attribute to reduce the search space, and introduce the correlation between condition attribute and decision attribute, which further improve the computational efficiency. We also design an adaptive decision management architecture for the green data center based on FHARA-S, which can improve decision-making efficiency and strengthen energy management. The experimental results show the speed of our algorithm gains up to 2.2X performance improvement over the traditional attribute reduction algorithm using MapReduce and 0.61X performance improvement over the algorithm using Spark. Besides, our algorithm also saves more computational resources.
机译:由绿色数据中心的能量消耗统计数据和其他相关数据组成的能量数据大幅增长。能量数据具有很大的价值,但它们内部的许多属性是多余的,不必要的,并且它们对数据中心的决策系统的性能产生严重影响。因此,已经构思了能量数据的属性降低作为关键步骤。然而,许多现有的属性减少算法通常是计算耗时的。为了解决这些问题,首先,我们扩展了粗糙集的方法来构建数据中心能量消耗知识表示系统。能量数据将发生由电源故障,能量不稳定或其他因素引起的一定程度的例外;因此,我们设计了一种使用火花的集成数据预处理方法,该方法主要包括采样分析,数据分类,缺少数据填充,异常数据预测和数据离散化。通过对存储器计算的良好优势,提出了一种使用火花的能量数据的快速启发式属性还原算法(FHARA-S)。在该算法中,我们使用高效算法来转换能量消耗决策表,启发式公式,用于测量属性减少搜索空间的重要性,并引入条件属性和决策属性之间的相关性,从而进一步提高了计算效率。我们还基于FHARA-S设计了绿色数据中心的自适应决策管理架构,可以提高决策效率和加强能源管理。实验结果表明,使用MapReduce和0.61X性能改进的传统属性缩减算法,我们的算法增益高达2.2倍的性能改进。此外,我们的算法还节省了更多的计算资源。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号