首页> 外文会议>International Conference on Information Technology Interfaces >Affinity-aware HPC applications in multichip and multicore multiprocessor
【24h】

Affinity-aware HPC applications in multichip and multicore multiprocessor

机译:Multichip和Multicore MultiProcessor中的亲和感知HPC应用程序

获取原文

摘要

Introducing multi level cache memory reduces the gap between the CPU and main memory and speeds up the program execution. The speedup in modern multiprocessors can scale up to linear speedup according to Gustafson's law. Each CPU core usually possesses private L1 and L2 cache memory and shares L3 cache memory in multi-core processor architectures. Furthermore, private or shared cache memory could have significant impact to the algorithm performance in parallel implementation. Private cache increases the overall cache size used during the execution. On the other hand, shared cache reduces cache misses if all CPU cores use the same data. In this paper we analyze the matrix vector multiplication algorithm performance for sequential and parallel implementation in multi-chip multi-core multiprocessor in order to determine the CPU affinity that provides the best performance. We also realize theoretical analysis to determine the problem size regions where selecting appropriate CPU affinity can produce the best performance using the same resources.
机译:引入多级缓存存储器可降低CPU和主存储器之间的间隙,并加快程序执行。根据Gustafson的法律,现代多处理器的加速可以扩展到线性加速。每个CPU内核通常具有私有L1和L2高速缓冲存储器,并在多核处理器架构中共享L3高速缓冲存储器。此外,私有或共享缓存内存可能对并行实现中的算法性能产生显着影响。私有缓存会增加执行期间使用的整体高速缓存大小。另一方面,如果所有CPU核心使用相同的数据,则共享缓存可减换缓存未命中。在本文中,我们分析了多芯片多核多核多处理器顺序和并行实现的矩阵矢量乘法算法性能,以确定提供最佳性能的CPU亲和力。我们还实现了理论分析来确定选择适当的CPU亲和力的问题大小区域可以使用相同资源产生最佳性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号