首页> 外文学位 >Coflow: A Networking Abstraction for Distributed Data-Parallel Applications.
【24h】

Coflow: A Networking Abstraction for Distributed Data-Parallel Applications.

机译:Coflow:分布式数据并行应用程序的网络抽象。

获取原文
获取原文并翻译 | 示例

摘要

Over the past decade, the confluence of an unprecedented growth in data volumes and the rapid rise of cloud computing has fundamentally transformed systems software and corresponding infrastructure. To deal with massive datasets, more and more applications today are scaling out to large datacenters. These distributed data-parallel applications run on tens to thousands of machines in parallel to exploit I/O parallelism, and they enable a wide variety of use cases, including interactive analysis, SQL queries, machine learning, and graph processing.;Communication between the distributed computation tasks of these applications often result in massive data transfers over the network. Consequently, concentrated efforts in both industry and academia have gone into building high-capacity, low-latency datacenter networks at scale. At the same time, researchers and practitioners have proposed a wide variety of solutions to minimize flow completion times or to ensure per-flow fairness based on the point-to-point flow abstraction that forms the basis of the TCP/IP stack.;We observe that despite rapid innovations in both applications and infrastructure, application- and network-level goals are moving further apart. Data-parallel applications care about all their flows, but today's networks treat each point-to-point flow independently. This fundamental mismatch has resulted in complex point solutions for application developers, a myriad of configuration options for end users, and an overall loss of performance.;The key contribution of this dissertation is bridging this gap between application-level performance and network-level optimizations through the coflow abstraction. Each multipoint-to-multipoint coflow represents a collection of flows with a common application-level performance objective, enabling application-aware decision making in the network. We describe complete solutions including architectures, algorithms, and implementations that apply coflows to multiple scenarios using central coordination, and we demonstrate through large-scale cloud deployments and trace-driven simulations that simply knowing how flows relate to each other is enough for better network scheduling, meeting more deadlines, and providing higher performance isolation than what is otherwise possible using today's application-agnostic solutions.;In addition to performance improvements, coflows allow us to consolidate communication optimizations across multiple applications, simplifying software development and relieving end users from parameter tuning. On the theoretical front, we discover and characterize for the first time the concurrent open shop scheduling with coupled resources family of problems. Because any flow is also a coflow with just one flow, coflows and coflow-based solutions presented in this dissertation generalize a large body of work in both networking and scheduling literatures.
机译:在过去的十年中,前所未有的数据量增长与云计算的迅速崛起的融合,从根本上改变了系统软件和相应的基础架构。为了处理海量数据集,当今越来越多的应用程序正在扩展到大型数据中心。这些分布式数据并行应用程序可并行运行在数以万计的计算机上,以利用I / O并行性,并且它们支持多种用例,包括交互式分析,SQL查询,机器学习和图形处理。这些应用程序的分布式计算任务通常会导致大量数据通过网络传输。因此,业界和学术界都在集中精力大规模构建高容量,低延迟的数据中心网络。同时,研究人员和从业人员提出了各种各样的解决方案,以基于构成TCP / IP堆栈基础的点对点流抽象,最小化流完成时间或确保每流公平性。观察到,尽管在应用程序和基础架构方面都进行了快速的创新,但是应用程序和网络级的目标却在进一步地相互偏离。数据并行应用程序关心它们的所有流,但是当今的网络独立地对待每个点对点流。这种根本的不匹配导致为应用程序开发人员提供了复杂的点解决方案,为最终用户提供了无数的配置选项,并且总体上降低了性能。本论文的主要贡献在于弥合了应用程序级性能和网络级优化之间的差距。通过同流抽象。每个多点到多点同流表示具有公共应用程序级性能目标的流的集合,从而使网络中的应用程序感知决策成为可能。我们描述了完整的解决方案,包括使用中心协调将并流应用于多个场景的体系结构,算法和实现,并且我们通过大规模的云部署和跟踪驱动的模拟进行了演示,证明仅了解流之间的相互关系足以进行更好的网络调度,与目前与应用程序不可知的解决方案相比,可以满足更多的截止日期,并提供更高的性能隔离。除了性能方面的改进之外,同流技术还使我们能够整合多个应用程序之间的通信优化,简化软件开发并减轻最终用户的参数调整负担。在理论上,我们首次发现并描述了具有耦合资源问题的并发开店调度。由于任何流也是只有一个流的同流,因此本文提出的基于同流和基于同流的解决方案概括了网络和调度文献中的大量工作。

著录项

  • 作者单位

    University of California, Berkeley.;

  • 授予单位 University of California, Berkeley.;
  • 学科 Computer science.
  • 学位 Ph.D.
  • 年度 2015
  • 页码 187 p.
  • 总页数 187
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号