首页> 外文会议>IEEE/ACM international symposium on cluster, cloud and grid computing >Cost-Efficient High-Performance Internet-Scale Data Analytics over Multi-cloud Environments
【24h】

Cost-Efficient High-Performance Internet-Scale Data Analytics over Multi-cloud Environments

机译:多云环境下经济高效的高性能互联网规模数据分析

获取原文

摘要

To analyze data distributed across the world, one can use distributed computing power to take advantage of data locality and achieve higher throughput. The multi-cloud model, a composition of multiple clouds, can provide cost-effective computing resources to process such distributed data. As multicolour becomes more and more accessible from cloud users, the use of MapReduce/Hadoop over multi-cloud is emerging, however, existing work has two issues in principle. First, it mainly focuses on maximizing throughput by improving data locality, but the perspective of cost optimization is missing. Second, conventional centralized optimization methods would not be able to scale well in multi-cloud environments due to its highly dynamic nature. We plan to solve the first issue by formalizing an optimization framework for MapReduce over multi-cloud including virtual machine and data transfer costs, and then the second issue by creating decentralized resource management middleware that considers multi-criteria (cost and performance) optimization. This paper reports progress we have made so far on these two directions.
机译:为了分析分布在世界各地的数据,可以使用分布式计算能力来利用数据局部性并实现更高的吞吐量。多云模型(由多个云组成)可以提供经济高效的计算资源来处理此类分布式数据。随着云用户越来越可以访问多色,在多云上使用MapReduce / Hadoop的趋势正在出现,但是,现有工作原则上存在两个问题。首先,它主要致力于通过改善数据局部性来最大化吞吐量,但是缺少成本优化的观点。其次,由于传统的集中式优化方法具有高度动态性,因此在多云环境中无法很好地扩展。我们计划通过形式化针对多云的MapReduce优化框架(包括虚拟机和数据传输成本)来解决第一个问题,然后通过创建考虑多标准(成本和性能)优化的分散式资源管理中间件来解决第二个问题。本文报告了迄今为止我们在这两个方向上所取得的进展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号