首页> 外文期刊>Procedia Computer Science >DupM: a Data Replica Allocation Strategy for Distributed Mining
【24h】

DupM: a Data Replica Allocation Strategy for Distributed Mining

机译:Dupm:分布式挖掘的数据副本分配策略

获取原文
           

摘要

The mass data mining algorithm is usually sensitive to computing resources, the quantities of which (e.g., the quantities of memory and CPUs available for mining algorithms) determine the mining results. In this paper, data replicas as a resource were incorporated into the resource scheduling strategy for distributed data mining to offer a replica-aware resource scheduling strategy DupM. With data replicas as a type of data resources, the DupM strategy scheduled data replicas through dynamic programming so that a cloud platform could, based on the requirements of distributed mining, allocate replicas. The simulation test of test data sets in KDD CUP events and IBM synthesizer data sets demonstrated that the DupM resource scheduling strategy has more advantages than Hadoop’s built-in resource scheduling strategy.
机译:质量数据挖掘算法通常对计算资源敏感,其中量(例如,用于采矿算法的存储器和CPU的数量)确定采矿结果。 在本文中,将数据副本作为资源结合到分布式数据挖掘的资源调度策略中,以提供副本感知资源调度策略Dupm。 使用数据副本作为数据资源的类型,Dupm策略通过动态编程预定数据副本,以便根据分布式挖掘,分配副本的要求,云平台可以。 KDD CUP事件和IBM合成器数据集中测试数据集的仿真测试表明,DUPM资源调度策略具有比Hadoop内置资源调度策略更多的优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号