首页> 外文会议>Euromicro Workshop on Parallel and Distributed Processing >Evaluation of a virtual shared memory machine by the compilation of data parallel loops
【24h】

Evaluation of a virtual shared memory machine by the compilation of data parallel loops

机译:通过汇编数据并行循环评估虚拟共享内存机器

获取原文

摘要

We introduce DVSA, distributed virtual shared areas, a virtual machine supporting the sharing of information on distributed memory architectures. The shared memory is structured as a set of areas where the size of each area may be chosen in an architecture dependent range. DVSA supports the sharing of areas rather than of variables because the exchange of chunks of data may result in better performances on distributed memory architectures offering little or no hardware support to information sharing. DVSA does not implement replication or prefetching strategies under the assumption that these strategies should be implemented by application specific virtual machines. The definition of these machines may often be driven by the compilation of the adopted programming languages. To validate the assumption, at first we consider the implementation of data parallel loops and show that a set of static analyses based on the closed forms approach makes it possible to define compiler driven caching and prefetching strategies. These strategies fully exploit the operations offered by the DVSA machine and they noticeably reduce the time to access shared information. The optimizations strategies that can be exploited by the compiler includes the merging of accesses to avoid multiple access to the same area, the prefetching of areas and the reduction of the overhead due to barrier synchronization. Preliminary performance figures are discussed.
机译:我们介绍DVSA,分布式虚拟共享区域,一个虚拟机支持分布在分布式内存架构上的信息。共享存储器被构造为可以在架构相关范围内选择每个区域的大小的一组区域。 DVSA支持共享区域而不是变量,因为数据的块的交换可能会导致分布式内存架构的更好的表现,为信息共享提供很少或没有硬件支持。 DVSA在假设这些策略应通过应用程序特定虚拟机实现的假设下实现复制或预取策略。这些机器的定义通常可以通过采用的编程语言的编译来驱动。为了验证假设,首先我们考虑实现数据并行循环的实现,并显示一组基于封闭形式方法的静态分析使得可以定义编译器驱动缓存和预取策略。这些策略充分利用DVSA机器提供的操作,并且它们明显减少访问共享信息的时间。可以由编译器利用的优化策略包括对访问的合并,以避免对相同区域的多次访问,预取区域和由于屏障同步而降低开销。讨论了初步性能数字。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号