Speedinq up Distributed Request-Response Workflows

Virajith Jalaparti; Peter Bodik; Srikanth Kandula; Ishai Menache; Mikhail Rybalkin; Chenyu Yan

首页> 外文期刊>Computer communication review >Speedinq up Distributed Request-Response Workflows

【24h】

Speedinq up Distributed Request-Response Workflows

机译：加快分布式请求-响应工作流

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We found that interactive services at Bing have highly variable datacenter-side processing latencies because their processing consists of many sequential stages, paral-lelization across 10s-1000s of servers and aggregation of responses across the network. To improve the tail latency of such services, we use a few building blocks: reissuing laggards elsewhere in the cluster, new policies to return incomplete results and speeding up laggards by giving them more resources. Combining these building blocks to reduce the overall latency is non-trivial because for the same amount of resource (e.g., number of reissues), different stages improve their latency by different amounts. We present Kwiken, a framework that takes an end-to-end view of latency improvements and costs. It decomposes the problem of minimizing latency over a general processing DAG into a manageable optimization over individual stages. Through simulations with production traces, we show sizable gains; the 99th percentile of latency improves by over 50% when just 0.1% of the responses are allowed to have partial results and by over 40% for 25% of the services when just 5% extra resources are used for reissues.

机译：我们发现Bing的交互式服务具有高度可变的数据中心端处理延迟，因为它们的处理包括许多顺序的阶段，跨10s-1000s服务器的并行化以及跨网络的响应聚合。为了改善此类服务的尾部等待时间，我们使用了一些构建块：重新发出集群中其他地方的落后者，返回不完整结果的新策略以及通过为落后者提供更多资源来加快落后者的速度。组合这些构件以减少总体等待时间并非易事，因为对于相同数量的资源（例如，重发次数），不同的阶段可以将等待时间提高不同的数量。我们介绍了Kwiken，这是一个对延迟改进和成本进行端到端查看的框架。它将最小化一般处理DAG上的延迟的问题分解为各个阶段上可管理的优化。通过对生产轨迹的模拟，我们显示出可观的收益；当仅允许0.1％的响应产生部分结果时，延迟的第99个百分位数将提高50％以上；而仅将5％的额外资源用于重新发布时，对于25％的服务，延迟将提高50％以上。

著录项

来源
《Computer communication review》 |2013年第4期|219-230|共12页
作者
Virajith Jalaparti; Peter Bodik; Srikanth Kandula; Ishai Menache; Mikhail Rybalkin; Chenyu Yan;
展开▼
作者单位

UIUC;

UIUC;

Microsoft;

Microsoft;

Steklov Math inst;

Microsoft;

展开▼
收录信息美国《科学引文索引》(SCI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Interactive services; Tail latency; Optimization; Reissues; Partial results;

机译：互动服务;尾部延迟优化;重新发行;部分结果;

相似文献

外文文献
中文文献
专利

1. Request-Response Distributed Power Management in Cloud Data Centers [J] . Jianxiang Li, Youchun Zhang Journal of Intelligent Systems . 2013,第4期

机译：云数据中心中的请求-响应分布式电源管理
2. Distributed monitoring and workflow management for goal-oriented workflows [J] . Jander Kai, Braubach Lars, Lamersdorf Winfried Concurrency and computation: practice and experience . 2016,第4期

机译：面向目标工作流的分布式监视和工作流管理
3. Mining workflow processes from distributed workflow enactment event logs [J] . Kim Kwanghoon Pio Knowledge Management & E-Learning: An International Journal . 2013,第4期

机译：从分布式工作流程制定事件日志中挖掘工作流程流程
4. Inferring Workflows with Job Dependencies from Distributed Processing Systems Logs (Or, how to evaluate your systems with realistic workflows NOT pulled out of thin air) [C] . Gladys E. Carrillo, Cristina L. Abad IEEE International Conference on Pervasive Intelligence and Computing . 2017

机译：使用来自分布式处理系统的作业依赖性的工作流程（或者如何使用逼真的工作流程（或者如何评估您的系统，而不是从薄空气中退出）
5. Streaming Transfer Optimization for Distributed Science Workflows [D] . Ucar, Davut. 2020

机译：分布式科学工作流的流传输优化
6. A Query Workflow Design to Perform Automatable Distributed Regression Analysis in Large Distributed Data Networks [O] . Qoua L. Her, Jessica M. Malenfant, Sarah Malek, -1

机译：在大型分布式数据网络中执行自动化分布式回归分析的查询工作流设计
7. Speeding up Distributed Request-Response Workflows [O] . 2015

机译：加快分布式请求 - 响应工作流程
8. Network Measurement of the VMTP Request-Response Protocol in the V Distributed System [R] . Cheriton, D. R., Williamson, C. L. 1987

机译：V分布式系统中VmTp请求 - 响应协议的网络测量

Speedinq up Distributed Request-Response Workflows

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅