【24h】

Weighted-Tuple Synchronization for Parallel Architecture Simulators

机译:并行体系结构模拟器的加权元组同步

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Simulation is a critical tool for evaluating processor and program performance and behavior in newly proposed computer architectures. When modeling target machines with hundreds or thousands of cores, parallel simulation approaches are an increasingly popular method to reduce the long simulation times inherent in single-threaded simulation. Unfortunately, synchronization forces a tradeoffs between performance and fidelity in these parallel simulators. In this work, we study the link between synchronization violations and architectural metric error in the form of CPI error. Further, we introduce weighted-tuple synchronization, a new distributed synchronization scheme that improves error-delay for parallel simulation. Each core periodically selects a group of synchronization targets, forming a synchronization tuple. The lead core then waits for the other cores to catch up. Selection occurs randomly, but is weighted to favor cores which cause more synchronization violations. With weighted-tuple synchronization and a synchronization interval of 100 cycles, average error delay improves over barrier synchronization by 41% and over random-pair synchronization by 35%.
机译:仿真是评估新提出的计算机体系结构中的处理器和程序性能以及行为的关键工具。在对具有数百或数千个内核的目标机器进行建模时,并行仿真方法是一种越来越流行的方法,可以减少单线程仿真中固有的长仿真时间。不幸的是,在这些并行模拟器中,同步迫使性能和保真度之间进行权衡。在这项工作中,我们以CPI错误的形式研究了同步违规和体系结构度量错误之间的联系。此外,我们引入了加权元组同步,这是一种新的分布式同步方案,可以改善并行仿真的错误延迟。每个内核定期选择一组同步目标,从而形成一个同步元组。然后,引导核心等待其他核心追赶。选择是随机发生的,但会被加权以偏爱会导致更多同步违规的核心。通过加权元组同步和100个周期的同步间隔,平均错误延迟比屏障同步提高了41%,与随机对同步提高了35%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号