首页> 外文期刊>Journal of Parallel and Distributed Computing >A framework for the design, synthesis and cycle-accurate simulation of multiprocessor networks
【24h】

A framework for the design, synthesis and cycle-accurate simulation of multiprocessor networks

机译:多处理器网络的设计,综合和精确周期仿真的框架

获取原文
获取原文并翻译 | 示例
           

摘要

This paper introduces a framework for the design, synthesis and cycle-accurate simulation for parallel computing networks of 128+ processors. In order to accurately characterize the network, we present a bottom-up design methodology in which each of the components are designed using a hardware description language and synthesized to an FPGA for performance estimation of the final ASIC implementation. The components are then integrated to form a parallel computing network and simulated using a cycle-accurate simulator with network traffic described by command files. This enabled us to simulate various switching techniques, three of which are presented in this paper: wormhole switching, circuit switching and a newly introduced technique called predictive circuit switching. In our experiments, four different representational traffics are generated for our simulation and, to show the flexibility of this model, we vary the cable lengths and thus their latency for all four test cases. Our results show that this hardware design, synthesis and cycle-accurate simulation methodology provides a useful method for evaluating design tradeoffs in parallel networks. A non-blocking queue, with up to 128 internal queues, and a real-time bandwidth scheduler, for up to 128 ports, were designed in hardware with hardware synthesis results presented. From our network simulation results, we conclude that predictive circuit switching exceeds the performance of packet switching for highly predictable traffic, like collective communications, and for heavily loaded unpredictable traffic with small packet sizes. As expected, predictive circuit switching significantly underperforms both packet and circuit switching for unpredictable traffic.
机译:本文介绍了用于128个以上处理器的并行计算网络的设计,综合和精确周期仿真的框架。为了准确地表征网络,我们提出了一种自下而上的设计方法,其中,每个组件都使用硬件描述语言进行设计,并合成到FPGA中以评估最终ASIC实现的性能。然后将这些组件集成在一起,以形成一个并行计算网络,并使用周期精确的仿真器对仿真进行仿真,并使用命令文件描述的网络流量。这使我们能够仿真各种开关技术,本文介绍了其中的三种:虫洞开关,电路开关和一种新引入的技术,称为预测电路开关。在我们的实验中,为我们的仿真生成了四种不同的代表性流量,并且为了显示该模型的灵活性,我们改变了电缆长度,并因此改变了所有四个测试用例的等待时间。我们的结果表明,这种硬件设计,综合和周期精确的仿真方法为评估并行网络中的设计折衷提供了一种有用的方法。在硬件中设计了一个具有多达128个内部队列的非阻塞队列和一个多达128个端口的实时带宽调度程序,并给出了硬件综合结果。从我们的网络仿真结果中,我们得出结论,对于高度可预测的流量(如集体通信)以及对于负载量较小的数据包,不可预测的流量,预测电路交换的性能超过了数据包交换的性能。不出所料,在不可预测的流量方面,预测电路交换的性能明显不及数据包交换和电路交换。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号