首页> 外文期刊>Parallel and Distributed Systems, IEEE Transactions on >Lightweight Chip Multi-Threading (LCMT): Maximizing Fine-Grained Parallelism On-Chip
【24h】

Lightweight Chip Multi-Threading (LCMT): Maximizing Fine-Grained Parallelism On-Chip

机译:轻量级芯片多线程(LCMT):最大化片上并行并行处理

获取原文
获取原文并翻译 | 示例

摘要

Irregular and dynamic applications, such as graph problems and agent-based simulations, often require fine-grained parallelism to achieve good performance. However, current multicore processors only provide architectural support for coarse-grained parallelism, making it necessary to use software-based multithreading environments to effectively implement fine-grained parallelism. Although these software-based environments have demonstrated superior performance over heavyweight, OS-level threads, they are still limited by the significant overhead involved in thread management and synchronization. In order to address this, we propose a Lightweight Chip Multi-Threaded (LCMT) architecture that further exploits thread-level parallelism (TLP) by incorporating direct architectural support for an ȁC;unlimitedȁD; number of dynamically created lightweight threads with very low thread management and synchronization overhead. The LCMT architecture can be implemented atop a mainstream architecture with minimum extra hardware to leverage existing legacy software environments. We compare the LCMT architecture with a Niagara-like baseline architecture. Our results show up to 1.8X better scalability, 1.91X better performance, and more importantly, 1.74X better performance per watt, using the LCMT architecture for irregular and dynamic benchmarks, when compared to the baseline architecture. The LCMT architecture delivers similar performance to the baseline architecture for regular benchmarks.
机译:不规则和动态的应用程序,例如图形问题和基于代理的模拟,通常需要细粒度的并行性才能获得良好的性能。但是,当前的多核处理器仅提供对粗粒度并行性的体系结构支持,因此有必要使用基于软件的多线程环境来有效地实现细粒度并行性。尽管这些基于软件的环境已显示出优于重量级OS级线程的性能,但它们仍受到线程管理和同步中涉及的大量开销的限制。为了解决这个问题,我们提出了一种轻量级芯片多线程(LCMT)架构,该架构通过结合对ȁC;unlimitedȁD;的直接架构支持来进一步利用线程级并行性(TLP)。动态创建的轻量级线程的数量,线程管理和同步开销非常低。 LCMT架构可以在具有最少额外硬件的主流架构之上实施,以利用现有的旧版软件环境。我们将LCMT架构与类似Niagara的基准架构进行了比较。我们的结果表明,与基线架构相比,使用LCMT架构进行不规则和动态基准测试时,可扩展性提高了1.8倍,性能提高了1.91倍,每瓦性能提高了1.74倍。 LCMT架构提供与常规基准测试基准架构类似的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号