首页> 外文会议>Design, Automation Test in Europe Conference Exhibition >Pegasus: Efficient data transfers for PGAS languages on non-cache-coherent many-cores
【24h】

Pegasus: Efficient data transfers for PGAS languages on non-cache-coherent many-cores

机译:Pegasus:在非缓存一致的多核上针对PGAS语言的高效数据传输

获取原文

摘要

To improve scalability, some many-core architectures abandon global cache coherence, but still provide a shared address space. Partitioning the shared memory and communicating via messages is a safe way of programming such machines. However, accessing pointered data structures from a foreign memory partition is expensive due to the required serialization. In this paper, we propose a novel data transfer technique that avoids serialization overhead for pointered data structures by managing cache coherence in software at object granularity. We show that for PGAS programming languages, the compiler and runtime system can completely handle the necessary cache management, thus requiring no changes to application code. Moreover, we explain how cache operations working on address ranges complement our data transfer technique. We propose a novel non-blocking implementation of range-based cache operations by offloading them to an enhanced cache controller. We evaluate our approach on a non-cache-coherent many-core architecture using a distributed-kernel benchmark suite and demonstrate a reduction of communication time of up to 39.8%.
机译:为了提高可伸缩性,某些多核体系结构放弃了全局缓存一致性,但仍提供了共享的地址空间。对共享内存进行分区并通过消息进行通信是对此类机器进行编程的一种安全方法。但是,由于需要序列化,因此从外部存储器分区访问指针数据结构的成本很高。在本文中,我们提出了一种新颖的数据传输技术,该技术通过在对象粒度下管理软件中的缓存一致性,避免了指针数据结构的序列化开销。我们证明,对于PGAS编程语言,编译器和运行时系统可以完全处理必要的缓存管理,因此无需更改应用程序代码。此外,我们解释了在地址范围上进行的缓存操作如何补充我们的数据传输技术。通过将它们卸载到增强型缓存控制器,我们提出了一种基于范围的缓存操作的新型非阻塞实现。我们使用分布式内核基准测试套件在非缓存一致的多核体系结构上评估了我们的方法,并证明通信时间最多可减少39.8%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号