首页> 外文会议>IEEE International Parallel and Distributed Processing Symposium Workshops >Parallel Implementation Strategies for Hierarchical Non-Uniform Memory Access Systems by Example of the Scale-Invariant Feature Transform Algorithm
【24h】

Parallel Implementation Strategies for Hierarchical Non-Uniform Memory Access Systems by Example of the Scale-Invariant Feature Transform Algorithm

机译:按比例不变特征变换算法的示例的分层非统一内存访问系统的并行实现策略

获取原文

摘要

The domains of parallel and distributed computing have been converging continuously up to the degree that state-of-the-art server computer systems incorporate characteristics from both domains: They comprise a hierarchy of enclosures, where each enclosure houses multiple processor sockets and each socket again contains multiple memory controllers. A global address space and cache coherency are facilitated using multiple layers of fast interconnection technologies even across enclosures. The growing popularity of such systems creates an urge for efficient mappings of cardinal algorithms onto such hierarchical architectures. However, the growing complexity of such systems and the inconsistencies between implementation strategies of different hardware vendors make it increasingly harder to do find efficient mapping strategies that are universally valid. In this paper, we present scalable optimization and mapping strategies in a case study of the popular Scale-Invariant Feature Transform (SIFT) computer vision algorithm. Our approaches are evaluated using a state-of-the-art hierarchical Non-Uniform Memory Access (NUMA) system with 240 physical cores and 12 terabytes of memory, apportioned across 16 NUMA nodes (sockets). SIFT is particularly interesting since the algorithm utilizes a variety of common data access patterns, thus allowing us to discuss the scaling properties of optimization strategies from the distributed and parallel computing domains and their applicability on emerging server systems.
机译:平行的且所述域分布式计算已收敛连续到该状态的最先进的服务器计算机系统结合特性从两个域的程度:它们包括外壳的层次结构,其中,每个外壳容纳多个处理器插槽和每个插座再次包含多个存储器控制器。全局地址空间和高速缓存一致性使用甚至跨磁盘阵列快速互连技术的多层便利。这种系统的日益普及,创建的基数算法的高效映射到这种层次架构的冲动。然而,这种系统的日益复杂和不同的硬件厂商实现战略之间的矛盾使得越来越难确实发现有效映射策略是普遍有效的。在本文中,我们在流行的尺度不变特征转换(SIFT)计算机视觉算法的情况下,研究提出可扩展的优化和映射策略。我们的方法使用的是有240个物理核心和存储器12兆兆字节的状态的最先进的分层非均匀存储器访问(NUMA)系统中,在16个NUMA节点(插座)分配进行评价。 SIFT是特别有趣,因为算法利用各种常见的数据访问模式,从而使我们能够从分布式讨论的优化策略的缩放属性和并行计算域和它们对新出现的服务器系统的适用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号