...
首页> 外文期刊>Computer architecture news >PowerChief: Intelligent Power Allocation for Multi-Stage Applications to Improve Responsiveness on Power Constrained CMP
【24h】

PowerChief: Intelligent Power Allocation for Multi-Stage Applications to Improve Responsiveness on Power Constrained CMP

机译:PowerChief:用于多阶段应用程序的智能电源分配,以提高对功率受限CMP的响应能力

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Modern user facing applications consist of multiple processing stages with a number of service instances in each stage. The latency profile of these multi-stage applications is intrinsically variable, making it challenging to provide satisfactory responsiveness. Given a limited power budget, improving the end-to-end latency requires intelligentiy boosting the bottleneck service across stages using multiple boosting techniques. However, prior work fail to acknowledge the multi-stage nature of user-facing applications and perform poorly in improving responsiveness on power constrained CMP, as they are unable to accurately identify bottleneck service and apply the boosting techniques adaptively. In this paper, we present PowerChief, a runtime framework that 1) provides joint design of service and query to monitor the latency statistics across service stages and accurately identifies the bottleneck service during runtime; 2) adaptively chooses the boosting technique to accelerate the bottleneck service with improved responsiveness; 3) dynamically reallocates the constrained power budget across service stages to accommodate the chosen boosting technique. Evaluated with real world multi-stage applications, PowerChief improves the average latency by 20.3x and 32.4x (99% tail latency by 13.3x and 19.4 x) for Sirius and Natural Language Processing applications respectively compared to stage-agnostic power allocation. In addition, for the given QoS target, PowerChief reduces the power consumption of Sirius and Web Search applications by 23% and 33% respectively over prior work.
机译:现代的面向用户的应用程序包含多个处理阶段,每个阶段都有许多服务实例。这些多阶段应用程序的等待时间曲线本质上是可变的,因此难以提供令人满意的响应能力。在功率预算有限的情况下,要改善端到端延迟,就需要使用多种增强技术来智能地增强各个阶段的瓶颈服务。但是,先前的工作无法承认面向用户的应用程序的多阶段性质,并且在提高对受功率限制的CMP的响应能力方面表现不佳,因为它们无法准确识别瓶颈服务并无法自适应地应用增强技术。在本文中,我们介绍了PowerChief,这是一个运行时框架,该框架1)提供服务和查询的联合设计,以监视服务各个阶段的延迟统计信息,并准确地识别运行时的瓶颈服务; 2)自适应选择增强技术,以提高响应速度来加速瓶颈服务; 3)在服务阶段动态地重新分配受约束的功率预算,以适应所选的提升技术。与现实世界中的多阶段应用程序相比,与阶段无关的功率分配相比,PowerChief将Sirius和Natural Language Processing应用程序的平均延迟分别提高了20.3倍和32.4倍(99%的尾部延迟分别为13.3倍和19.4倍)。此外,对于给定的QoS目标,PowerChief将Sirius和Web搜索应用程序的功耗分别降低了23%和33%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号