首页> 外文期刊>Future generation computer systems >Unicorn: Unified resource orchestration for multi-domain, geo-distributed data analytics
【24h】

Unicorn: Unified resource orchestration for multi-domain, geo-distributed data analytics

机译:Unicorn:统一的资源编排,用于多域,地理分布的数据分析

获取原文
获取原文并翻译 | 示例

摘要

As the data volume increases exponentially over time, data-intensive analytics benefits substantially from multi-organizational, geographically-distributed, collaborative computing, where different organizations contribute various yet scarce resources, e.g., computation, storage and networking resources, to collaboratively collect, share and analyze extremely large amounts of data. By analyzing the data analytics trace from the Compact Muon Solenoid (CMS) experiment, one of the largest scientific experiments in the world, and systematically examining the design of existing resource management systems for clusters, we show that the multi-domain, geo-distributed, resource-disaggregated nature of this new paradigm calls for a framework to manage a large set of distributively-owned, heterogeneous resources, with the objective of efficient resource utilization, following the autonomy and privacy of different domains, and that the fundamental challenge for designing such a framework is: how to accurately discover and represent resource availability of a large set of distributively-owned, heterogeneous resources across different domains with minimal information exposure from each domain? Existing resource management systems are designed for single-domain clusters and cannot address this challenge. In this paper, we design Unicorn, the first unified resource orchestration framework for multi-domain, geo-distributed data analytics. In Unicorn, we encode the resource availability for each domain into resource state abstraction, a variant of the network view abstraction extended to accurately represent the availability of multiple resources with minimal information exposure using a set of linear inequalities. We then design a novel, efficient cross-domain query algorithm and a privacy-preserving resource information integration protocol to discover and integrate the accurate, minimal resource availability information for a set of data analytics jobs across different domains. In addition, Unicorn also contains a global resource orchestrator that computes optimal resource allocation decisions for data analytics jobs. We implement a prototype of Unicorn and present preliminary evaluation results to demonstrate its efficiency and efficacy. We also give a full demonstration of the Unicorn system at SuperComputing 2017. (C) 2018 Elsevier B.V. All rights reserved.
机译:随着数据量随时间呈指数级增长,数据密集型分析将从多组织,地理分布的协作计算中受益匪浅,在协作计算中,不同的组织会贡献各种但稀缺的资源(例如计算,存储和网络资源)来协作收集,共享并分析大量数据通过分析来自世界上最大的科学实验之一的紧凑型Muon电磁阀(CMS)实验的数据分析轨迹,并系统地检查集群的现有资源管理系统的设计,我们发现多域,地理分布,这种新范式的资源分解性质要求建立一个框架来管理大量分布式拥有的异构资源,其目的是遵循不同域的自治性和私密性,以有效地利用资源,这对设计带来了根本性挑战这样的框架是:如何准确地发现和表示跨不同域的大量分布式拥有的异构资源的资源可用性,同时使每个域的信息暴露量最小?现有的资源管理系统是为单域群集设计的,无法解决这一挑战。在本文中,我们设计了Unicorn,这是第一个用于多域,地理分布的数据分析的统一资源编排框架。在Unicorn中,我们将每个域的资源可用性编码为资源状态抽象,这是网络视图抽象的一种变体,已扩展为使用一组线性不等式以最小的信息暴露来准确表示多个资源的可用性。然后,我们设计一种新颖,高效的跨域查询算法和一个保护隐私的资源信息集成协议,以发现和集成针对不同域的一组数据分析作业的准确,最少的资源可用性信息。此外,Unicorn还包含一个全局资源协调器,可为数据分析作业计算最佳资源分配决策。我们实施了Unicorn的原型,并提供了初步评估结果以证明其效率和功效。我们还将在SuperComputing 2017上对Unicorn系统进行全面演示。(C)2018 Elsevier B.V.保留所有权利。

著录项

  • 来源
    《Future generation computer systems》 |2019年第4期|188-197|共10页
  • 作者单位

    Tongji Univ, Dept Comp Sci & Engn, Shanghai, Peoples R China|Yale Univ, Dept Comp Sci, 51 Prospect St, New Haven, CT 06520 USA;

    Tongji Univ, Dept Comp Sci & Engn, Shanghai, Peoples R China;

    Tongji Univ, Dept Comp Sci & Engn, Shanghai, Peoples R China;

    CALTECH, Phys, Pasadena, CA 91125 USA;

    Tongji Univ, Dept Comp Sci & Engn, Shanghai, Peoples R China|Yale Univ, Comp Sci & Elect Engn, New Haven, CT 06520 USA;

    Tongji Univ, Dept Comp Sci & Engn, Shanghai, Peoples R China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号