首页> 外文期刊>Parallel Computing >Controlling NUMA effects in embedded manycore applications with lightweight nested parallelism support
【24h】

Controlling NUMA effects in embedded manycore applications with lightweight nested parallelism support

机译:支持轻量级嵌套并行性,可在嵌入式多核应用程序中控制NUMA效果

获取原文
获取原文并翻译 | 示例

摘要

Embedded manycore architectures are often organized as fabrics of tightly-coupled shared memory clusters. A hierarchical interconnection system is used with a crossbar-like medium inside each cluster and a network-on-chip (NoC) at the global level which make memory operations nonuniform (NUMA). Due to NUMA, regular applications typically employed in the embedded domain (e.g., image processing, computer vision, etc.) ultimately behave as irregular workloads if a flat memory system is assumed at the program level. Nested parallelism represents a powerful programming abstraction for these architectures, provided that (i) streamlined middleware support is available, whose overhead does not dominate the run-time of fine-grained applications; (ii) a mechanism to control thread binding at the cluster-level is supported. We present a lightweight runtime layer for nested parallelism on cluster-based embedded manycores, integrating our primitives in the OpenMP runtime system, and implementing a new directive to control NUMA-aware nested parallelism mapping. We explore on a set of real application use cases how NUMA makes regular parallel workloads behave as irregular, and how our approach allows to control such effects and achieve up to 28 x speedup versus flat parallelism. (C) 2016 Elsevier B.V. All rights reserved.
机译:嵌入式多核体系结构通常组织为紧密耦合的共享内存集群的结构。分层互连系统与每个群集内部的纵横制介质以及全局级别的片上网络(NoC)配合使用,从而使内存操作不一致(NUMA)。由于NUMA,如果在程序级别假定使用平面存储系统,则通常在嵌入式领域中使用的常规应用程序(例如,图像处理,计算机视觉等)最终会充当不规则的工作负载。嵌套的并行机制代表了这些体系结构的强大编程抽象,前提是:(i)可以使用简化的中间件支持,其开销不会控制细粒度应用程序的运行时; (ii)支持在集群级别控制线程绑定的机制。我们为基于集群的嵌入式多核上的嵌套并行性提供了一个轻量级的运行时层,将我们的原语集成到OpenMP运行时系统中,并实现了一个新指令来控制可识别NUMA的嵌套并行性映射。我们在一组实际的应用程序用例中探讨了NUMA如何使常规并行工作负载表现为不规则行为,以及我们的方法如何允许控制此类影响并实现比平面并行性高多达28倍的加速。 (C)2016 Elsevier B.V.保留所有权利。

著录项

  • 来源
    《Parallel Computing》 |2016年第11期|24-42|共19页
  • 作者单位

    Swiss Fed Inst Technol Zurich ETHZ, D ITET, Gloriastr 35, CH-8092 Zurich, Switzerland|Univ Bologna, DEI, Wale Risorgimento 2, I-40136 Bologna, Italy;

    Univ Bologna, DEI, Wale Risorgimento 2, I-40136 Bologna, Italy;

    Swiss Fed Inst Technol Zurich ETHZ, D ITET, Gloriastr 35, CH-8092 Zurich, Switzerland|Univ Bologna, DEI, Wale Risorgimento 2, I-40136 Bologna, Italy;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Manycores; Nested parallelism; OpenMP;

    机译:Manycores;嵌套并行性;OpenMP;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号