Affinity-aware HPC applications in multichip and multicore multiprocessor

机译：Multichip和Multicore MultiProcessor中的亲和感知HPC应用程序

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Introducing multi level cache memory reduces the gap between the CPU and main memory and speeds up the program execution. The speedup in modern multiprocessors can scale up to linear speedup according to Gustafson's law. Each CPU core usually possesses private L1 and L2 cache memory and shares L3 cache memory in multi-core processor architectures. Furthermore, private or shared cache memory could have significant impact to the algorithm performance in parallel implementation. Private cache increases the overall cache size used during the execution. On the other hand, shared cache reduces cache misses if all CPU cores use the same data. In this paper we analyze the matrix vector multiplication algorithm performance for sequential and parallel implementation in multi-chip multi-core multiprocessor in order to determine the CPU affinity that provides the best performance. We also realize theoretical analysis to determine the problem size regions where selecting appropriate CPU affinity can produce the best performance using the same resources.

机译：引入多级缓存存储器可降低CPU和主存储器之间的间隙，并加快程序执行。根据Gustafson的法律，现代多处理器的加速可以扩展到线性加速。每个CPU内核通常具有私有L1和L2高速缓冲存储器，并在多核处理器架构中共享L3高速缓冲存储器。此外，私有或共享缓存内存可能对并行实现中的算法性能产生显着影响。私有缓存会增加执行期间使用的整体高速缓存大小。另一方面，如果所有CPU核心使用相同的数据，则共享缓存可减换缓存未命中。在本文中，我们分析了多芯片多核多核多处理器顺序和并行实现的矩阵矢量乘法算法性能，以确定提供最佳性能的CPU亲和力。我们还实现了理论分析来确定选择适当的CPU亲和力的问题大小区域可以使用相同资源产生最佳性能。

著录项

来源
《International Conference on Information Technology Interfaces》|2013年||共6页
会议地点
作者
Velkoski Goran; Ristov Sasko; Gusev Marjan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 G202-53;
关键词
CPU Cache; Matrix - Vector Multiplication; Performance; Speed;

机译：CPU缓存;矩阵 - 矢量乘法;性能;速度;

相似文献

外文文献
中文文献
专利

1. Optimizing multi-tier application performance with interference and affinity-aware placement algorithms [J] . Uillian L. Ludwig, Miguel G. Xavier, Dionatra F. Kirchoff, Concurrency, practice and experience . 2019,第18期

机译：使用干扰和亲和力感知放置算法优化多层应用程序性能
2. Optimizing multi-tier application performance with interference and affinity-aware placement algorithms [J] . Uillian L. Ludwig, Miguel G. Xavier, Dionatra F. Kirchoff, Concurrency, practice and experience . 2019,第18期

机译：使用干扰和亲和感知放置算法优化多层应用程序性能
3. Cooling of Multicore Multiprocessors [J] . Krishnamachar Sreenivasan Journal of Energy, Heat and Mass Transfer . 2015,第1a4期

机译：多核多处理器的冷却
4. Affinity-aware HPC applications in multichip and multicore multiprocessor [C] . Velkoski Goran, Ristov Sasko, Gusev Marjan 35th International Conference on Information Technology Interfaces : Research and Education using Mobile and Social Networking: When, Where, and How . 2013

机译：多芯片和多核多处理器中具有亲和力的HPC应用程序
5. Performance projections of HPC applications on chip multiprocessor (CMP) based systems [D] . Sharkawi, Sameh Sh Shawky 2011

机译：基于芯片多处理器（CMP）的系统上HPC应用程序的性能预测
6. Time-energy measured data on modern multicore systems running shared-memory applications [O] . Dumitrel Loghin, Yong Meng Teo 2019

机译：运行共享内存应用程序的现代多核系统上的时间能量测量数据
7. Performance Projections of HPC Applications on Chip Multiprocessor (CMP) Based Systems [O] . Shawky Sharkawi Sameh Sh 2011

机译：基于芯片多处理器（CMP）的系统上HPC应用程序的性能预测

Affinity-aware HPC applications in multichip and multicore multiprocessor

摘要

著录项

相似文献

相关主题

期刊订阅