首页> 外文会议>Massively parallel processing using optical interconnections >On Some Architectural Issues of Optical Hierarchical Ring Networks for Shared-Memory Multiprocessors
【24h】

On Some Architectural Issues of Optical Hierarchical Ring Networks for Shared-Memory Multiprocessors

机译:共享内存多处理器光分层环网的一些体系结构问题

获取原文
获取原文并翻译 | 示例

摘要

Optical hierarchical ring networks with 2 and 3 levels for multiprocessors are studied through simple analytical modeling and extensive simulations. The performance of the four possible deflection routing schemes to resolve contentions is simulated and found to be relatively the same. Comparison of deflection routing and buffering, under the assumption that each slot contains one bit along the temporal dimension, shows that the transaction delays in systems using deflection routing increase faster than in systems with buffering with an increase in traffic intensity. However, the performance gain by reconfiguring from a 2-level deflection system to a 3-level system is significant, and the gain can outperform buffering in a 2-level system. It is postulated, nevertheless, that deflection routing should outperform the buffering scheme when each slot contains more bits along the temporal dimension, because the o-e and e-o cost of the latter is proportional to the number of bits whereas it is constant for the former. Non-contention optimal configurations are found by minimizing the maximum transaction delay and the average transaction delay. However, when contentions are considered, those configurations that minimize the average non-contention delay perform worse than those which minimize the maximum non-contention delay. The poor performance is the result of quick saturation at the global ring. However, configurations that result from minimizing the maximum or average non-contention delay may be far from the true optimal configuration specific to a particular workload, especially when the traffic load is high, and traffic is localized.
机译:通过简单的分析模型和广泛的仿真,研究了用于多处理器的具有2级和3级光学分层环网。模拟了四种可能的偏转路由方案解决争用的性能,发现它们相对相同。在每个时隙沿时间维度包含一位的假设下,偏转路由和缓冲的比较表明,使用偏转路由的系统中的事务延迟比具有缓冲的系统中的业务延迟随着业务量的增加而增加得更快。但是,通过从2级偏转系统重新配置为3级系统,可以显着提高性能,并且该性能优于2级系统中的缓冲。但是,可以假设,当每个时隙沿时间维度包含更多位时,偏转路由应优于缓冲方案,因为后者的o-e和e-o成本与位的数量成正比,而对于前者则是恒定的。通过最小化最大事务延迟和平均事务延迟来找到非竞争性最佳配置。但是,当考虑竞争时,那些使平均非竞争延迟最小的配置的性能要比使最大非竞争延迟最小的配置的性能差。性能差是全局环快速饱和的结果。但是,最小化最大或平均非竞争延迟所导致的配置可能与特定于特定工作负载的真正最佳配置相去甚远,尤其是在流量负载较高且流量已本地化的情况下。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号