...
首页> 外文期刊>Journal of Electrical and Electronic Engineering >Analysis of World Experience in Creating Parallel Computing Systems Designed to Effectively Solve DIS-tasks
【24h】

Analysis of World Experience in Creating Parallel Computing Systems Designed to Effectively Solve DIS-tasks

机译:分析创建旨在有效解决DIS任务的并行计算系统的世界经验

获取原文
           

摘要

Author describes world experience in creating parallel computing systems by example Cray XE6 and network chip Gemini, designed to effectively solve Data intensive tasks (DIS-tasks). Most often, in modern supercomputers (SC), architecture options with shared (shared) memory are used to provide effective solutions to problems of high capacitive complexity, including those that contain mostly irregular work with memory. It is possible to provide support for a programming model with shared (shared) memory in various ways using hardware, as well as using virtualization software. Different options for implementing a shared memory programming model may vary in functionality and timing of memory accesses. The problem of the "memory wall" is that if arithmetic-logical operations take several processor cycles, then operations directly with the memory take several hundred cycles. If the memory is formed from the memories of computing nodes connected by a communication network, then the execution time of such a call includes the time of operation with the network to transfer addresses and data. This already increases the memory access time to several thousand cycles. The problem is that such delays in accessing data cause idle functional units of the processor - they cannot perform arithmetic and logical operations on data, because they simply do not exist due to the large delays in performing operations with memory.
机译:作者通过示例Cray XE6和网络芯片Gemini描述了在创建并行计算系统方面的世界经验,这些示例旨在有效解决数据密集型任务(DIS-tasks)。大多数情况下,在现代超级计算机(SC)中,具有共享(共享)内存的体系结构选项用于提供有效的解决方案,以解决高电容复杂性的问题,包括那些大多数包含不规则的内存工作的问题。可以使用硬件以及使用虚拟化软件以各种方式为具有共享(共享)内存的编程模型提供支持。用于实现共享内存编程模型的不同选项可能会在功能和内存访问时间方面有所不同。 “内存隔离墙”的问题在于,如果算术逻辑运算需要几个处理器周期,那么直接对内存的运算就需要数百个周期。如果该存储器是由通过通信网络连接的计算节点的存储器形成的,则这种调用的执行时间包括与网络进行操作以传输地址和数据的时间。这已经将内存访问时间增加到数千个周期。问题在于访问数据的这种延迟会导致处理器的空闲功能单元-它们无法对数据执行算术和逻辑运算,因为由于使用内存执行操作的延迟较大,因此它们根本不存在。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号