Support for High-Frequency Streaming in CMPs

机译：支持CMP中的高频流

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

As the industry moves toward larger-scale chip multiprocessors, the need to parallelize applications grows. High inter-thread communication delays, exacerbated by over-stressed high-latency memory subsystems and ever-increasing wire delays, require parallelization techniques to create partially or fully independent threads to improve performance. Unfortunately, developers and compilers alike often fail to find sufficient independent work of this kind. Recently proposed pipelined streaming techniques have shown significant promise for both manual and automatic parallelization. These techniques have wide-scale applicability because they embrace inter-thread dependences (albeit acyclic dependences) and tolerate long-latency communication of these dependences. This paper addresses the lack of architectural support for this type of concurrency, which has blocked its adoption and hindered related language and compiler research. We observe that both manual and automatic techniques create high-frequency streaming threads, with communication occurring every 5 to 20 instructions. Even while easily tolerating inter-thread transit delays, high-frequency communication makes thread performance very sensitive to intrathread delays from the repeated execution of the communication operations. Using this observation, we define the design-space and evaluate several mechanisms to find a better trade-off between performance and operating system, hardware, and design costs. From this, we find a light-weight streaming-aware enhancement to conventional memory subsystems that doubles the speed of these codes and is within 2% of the best-performing, but heavy-weight, hardware solution.

机译：随着行业朝着大规模芯片多处理器发展，并行化应用程序的需求也在增长。高线程间通信延迟会因过高的高延迟内存子系统以及不断增加的连线延迟而加剧，需要并行化技术来创建部分或完全独立的线程以提高性能。不幸的是，开发人员和编译人员都常常找不到足够的这种独立工作。最近提出的流水线流技术已显示出对手动和自动并行化的巨大希望。这些技术具有广泛的适用性，因为它们包含线程间依赖性（尽管是非循环依赖性），并且可以容忍这些依赖性的长时延通信。本文解决了这种并发缺乏架构支持的问题，这阻碍了它的采用，并阻碍了相关语言和编译器的研究。我们观察到，手动和自动技术都会创建高频流线程，每5到20条指令就会进行一次通信。即使容易容忍线程间传输延迟，高频通信也使线程性能对通信操作重复执行产生的线程内延迟非常敏感。使用此观察，我们定义了设计空间并评估了几种机制，以在性能与操作系统，硬件和设计成本之间找到更好的折衷方案。由此，我们发现传统存储子系统的轻量级流感知增强功能使这些代码的速度提高了一倍，并且在性能最佳但重量较重的硬件解决方案的2％之内。

著录项

来源
《Annual IEEE/ACM International Symposium on Microarchitecture;IEEE/ACM International Symposium on Microarchitecture》|2006年|P.259-272|共14页
会议地点
作者
Ram Rangan; Neil Vachharajani; Adam Stoler; Guilherme Ottoni; David I. August; George Z. N. Cai; PRam Rangan; PGuilherme Ottoni;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类总体结构、系统结构;
关键词

相似文献

外文文献
中文文献
专利

1. Dielectric analysis of CMPS-supported ionic liquids (ILs) microspheres in model gasoline by means of dielectric relaxation spectroscopy [J] . Han M., Chen M., Wan H., Colloids and Surfaces, A. Physicochemical and Engineering Aspects . 2013,第Null期

机译：介电弛豫光谱法分析模型汽油中CMPS负载的离子液体（ILs）微球的介电性能
2. Supporting Microthread Scheduling and Synchronisation in CMPs [J] . Ian Bell, Nabil Hasasneh, Chris Jesshope International journal of parallel programming . 2006,第4期

机译：在CMP中支持微线程调度和同步
3. Data stream processing in HPC systems: New frameworks and architectures for high-frequency streaming [J] . Aldinucci Marco, Cardellini Valeria, Mencagli Gabriele, Parallel Computing . 2020,第Octa期

机译：HPC系统中的数据流处理：用于高频流的新框架和架构
4. A Low-overhead Dedicated Execution Support for Stream Applications on Shared-memory CMP [C] . Paul Dubrulle, Stephane Louise, Renaud Sirdey, ACM international conference on embedded software . 2012

机译：对共享内存CMP上的流应用程序的低开销专用执行支持
5. Pulse of the stream: Evaluation of high-frequency dissolved organic carbon and nitrate concentrations in stream water for diel and autumn periods [D] . Winters, Catherine Grace. 2016

机译：溪流脉动：评估diel和秋季期间溪流水中的高频溶解有机碳和硝酸盐浓度
6. Knowledge discovery from high-frequency stream nitrate concentrations: hydrology and biology contributions [O] . Alice H. Aubert, Michael C. Thrun, Lutz Breuer, -1

机译：高频硝酸盐浓度下的知识发现：水文学和生物学贡献
7. Comparative evaluation of DHDECMP (dihexyl-N,N-diethylcarbamoyl-methylphosphonate) and CMPO (octylphenyl-N,N,-diisobutylcarbamoylmethylphosphine oxide) as extractants for recovering actinides from nitric acid waste streams [O] . S.F. Marsh, S.L. Yarbro 1988

机译：DHDECMP（二己基-N，N-二乙基氨基甲酰基 - 甲基膦酸酯）和CMPO（辛基苯基-N，N，-diisbutylcarbamoylmoshys氧化物）作为回收硝酸废物流的萃取剂的比较评价
8. Comparative Evaluation of DHDECMP (Dihexyl-N,N-Diethylcarbamoyl-Methylphosphonate) and CMPO (Octylphenyl-N,N,-Diisobutylcarbamoylmethylphosphine Oxide) as Extractants for Recovering Actinides from Nitric Acid Waste Streams [R] . Marsh, S. F. , Yarbro, S. L. 1988

机译：DHDECmp（二己基-N，N-二乙基氨基甲酰基 - 甲基膦酸酯）和CmpO（辛基苯基-N，N， - 二异丁基氨基甲酰基甲基膦氧化物）作为从硝酸废物流中回收act系元素的萃取剂的对比评价

Support for High-Frequency Streaming in CMPs

摘要

著录项

相似文献

相关主题

期刊订阅