首页> 外文OA文献 >Hardware for Fast Global Operations on Distributed Memory Multicomputers and Multiprocessors
【2h】

Hardware for Fast Global Operations on Distributed Memory Multicomputers and Multiprocessors

机译:用于分布式存储器多计算机和多处理器的快速全局操作的硬件

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

u22Grand Challengeu22 problems such as climate modeling to predict droughts and human genome mapping to predict and possibly cure diseases such as cancer require massive computing power. Three kinds of computer systems currently used in attempts to solve these problems are u22Big Ironu22 multicomputers such as the Intel Paragon, workstation cluster multicomputers, and distributed shared memory multiprocessors such as the Cray T3D. Machines such as these are inefficient in executing some or all of a set of global program operations which are important in many of the u22Grand Challengeu22 programs. These operations include synchronization, reduction, MAX, MIN, one-to-all broadcasting, all-to-all broadcasting, and orderly access to global shared variables. My hypothesis was that a secondary network with a wide tree topology and one or more centralized processors optimized for these operations could substantially decrease their execution time on all three types of systems. To test my hypothesis, I developed the secondary network and Coordination Processor(COP) system described in this dissertation, modeled the major blocks of the design in VHDL, and simulated these blocks to verify their logic and get realistic timing values. The analyses developed for the COP system clearly demonstrate that it can speed up a variety of common global operations by as much as 2-3 orders of magnitude when added to any of several current multicomputers and multiprocessors. Examples show that this speedup reduces overall execution time for important scientific programs and computational kernels by an average of 25% at an increase in system cost of only about 2%. Further analyses show that for these global operations the COP system has a greater combination of speed and versatility than any other system.
机译:巨大挑战诸如气候模型以预测干旱和人类基因组图谱以预测并可能治愈诸如癌症的疾病等问题需要大量的计算能力。当前用于解决这些问题的三种计算机系统是Intel Paragon等多计算机,工作站集群多计算机和Cray T3D等分布式共享内存多处理器。诸如此类的机器在执行一组全局程序操作中的某些或全部过程中效率低下,而这在许多“宏伟挑战”程序中很重要。这些操作包括同步,缩减,MAX,MIN,一对一广播,所有对广播以及对全局共享变量的有序访问。我的假设是,具有宽树拓扑的辅助网络以及针对这些操作进行了优化的一个或多个集中处理器可以大大减少它们在所有三种类型的系统上的执行时间。为了验证我的假设,我开发了本文描述的辅助网络和协调处理器(COP)系统,在VHDL中对设计的主要模块进行了建模,并对这些模块进行了仿真,以验证其逻辑并获得实际的时序值。为COP系统开发的分析清楚地表明,将COP系统添加到当前的多台多计算机和多处理器中的任何一个上,可以将各种通用的全局操作速度提高2-3个数量级。实例表明,这种加速将重要的科学程序和计算内核的总体执行时间平均缩短了25%,而系统成本仅增加了约2%。进一步的分析表明,对于这些全局操作,COP系统比其他系统具有更快的速度和多功能性。

著录项

  • 作者

    Hall Douglas V.;

  • 作者单位
  • 年度 1995
  • 总页数
  • 原文格式 PDF
  • 正文语种
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号