Application-Transparent Near-Memory Processing Architecture with Memory Channel Network

机译：应用程序透明近记忆处理架构，具有内存通道网络

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The physical memory capacity of servers is expected to increase drastically with deployment of the forthcoming non-volatile memory technologies. This is a welcomed improvement for emerging data-intensive applications. For such servers to be cost-effective, nonetheless, we must cost-effectively increase compute throughput and memory bandwidth commensurate with the increase in memory capacity without compromising application readiness. Tackling this challenge, we present Memory Channel Network (MCN) architecture in this paper. Specifically, first, we propose an MCN DIMM, an extension of a buffered DIMM where a small but capable processor called MCN processor is integrated with a buffer device on the DIMM for near-memory processing. Second, we implement device drivers to give the host and MCN processors in a server an illusion that they are independent heterogeneous nodes connected through an Ethernet link. These allow the host and MCN processors in a server to run a given data-intensive application together based on popular distributed computing frameworks such as MPI and Spark without any change in the host processor hardware and its application software, while offering the benefits of high-bandwidth and low-latency communications between the host and the MCN processors over memory channels. As such, MCN can serve as an application-transparent framework which can seamlessly unify near-memory processing within a server and distributed computing across such servers for data-intensive applications. Our simulation running the full software stack shows that a server with 8 MCN DIMMs offers 4.56X higher throughput and consume 47.5% less energy than a cluster with 9 conventional nodes connected through Ethernet links, as it facilitates up to 8.17X higher aggregate DRAM bandwidth utilization. Lastly, we demonstrate the feasibility of MCN with an IBM POWER8 system and an experimental buffered DIMM.

机译：服务器的物理内存容量预计会随着即将到来的非易失性存储器技术的部署大大增加。这是新兴数据密集型应用的欢迎改进。对于这种服务器具有成本效益，尽管如此，我们必须成本有效地提高计算吞吐量和内存带宽，随着存储容量的增加而不会损害应用程序准备。解决这一挑战，我们在本文中呈现了内存通道网络（MCN）架构。具体地，首先，我们提出了一种MCN DIMM，缓冲DIMM的延伸，其中一个名为MCN处理器的小但有能力的处理器与DIMM上的缓冲器设备集成，用于近存储器处理。其次，我们实施设备驱动程序在服务器中为主机和MCN处理器提供一个错觉，它们是通过以太网链路连接的独立异构节点。这些允许服务器中的主机和MCN处理器根据流行的分布式计算框架（如MPI和Spark）一起运行给定的数据密集型应用程序，没有主机硬件及其应用软件的任何变化，同时提供高的好处 - 主机和MCN处理器之间的带宽和低延迟通信通过内存通道。这样，MCN可以用作应用程序透明框架，其可以在服务器内无缝统一近存储器处理，并在这些服务器中分布计算以进行数据密集型应用。我们的仿真运行全软件堆栈显示，具有8个MCN DIMM的服务器提供4.56倍的吞吐量，并且比通过以太网链路连接的9个传统节点的集群消耗47.5％的时间，因为它有助于高达8.17倍的聚合DRAM带宽利用率。最后，我们展示了MCN与IBM Power8系统的可行性和实验缓冲DIMM。

著录项

来源
《International Symposium on Microarchitecture》|2018年|xxiv 493 p. :|共13页
会议地点
作者
Mohammad Alian; Seung Won Min; Hadi Asgharimoghaddam; Ashutosh Dhar; Dong Kai Wang; Thomas Roewer; Adam McPadden; Oliver OHalloran; Deming Chen; Jinjun Xiong; Daehoon Kim; Wen-mei Hwu; Nam Sung Kim;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP302-532;
关键词
Servers; Bandwidth; Random access memory; Hardware; Computer architecture; Ethernet;

机译：服务器;带宽;随机存取存储器;硬件;计算机架构;以太网;

相似文献

外文文献
中文文献
专利

1. A Scalable Near-Memory Architecture for Training Deep Neural Networks on Large In-Memory Datasets [J] . Schuiki Fabian, Schaffner Michael, Gurkaynak Frank K., IEEE Transactions on Computers . 2019,第4期

机译：可扩展的近内存体系结构，用于在大型内存数据集中训练深度神经网络
2. A Scalable Near-Memory Architecture for Training Deep Neural Networks on Large In-Memory Datasets [J] . Schuiki Fabian, Schaffner Michael, Gurkaynak Frank K., IEEE Transactions on Computers . 2019,第4期

机译：一种可扩展的近记忆架构，用于培训大型内存数据集的深神经网络
3. Algorithm/Architecture Co-Design for Near-Memory Processing [J] . Mario Drumond, Dmitrii Ustiugov, Alexandras Daglis, Operating systems review . 2018,第1期

机译：近内存处理的算法/架构协同设计
4. Application-Transparent Near-Memory Processing Architecture with Memory Channel Network [C] . Mohammad Alian, Seung Won Min, Hadi Asgharimoghaddam, Annual IEEE/ACM International Symposium on Microarchitecture . 2018

机译：具有存储器通道网络的应用程序透明的近存储器处理体系结构
5. Architecture and performance of processor-memory interconnection networks for MIMD shared memory parallel processing systems. [D] . Liu, Yue-Sheng. 1990

机译：MIMD共享内存并行处理系统的处理器-内存互连网络的体系结构和性能。
6. Inactivation of the Anterior Cingulate Reveals Enhanced Reliance on Cortical Networks for Remote Spatial Memory Retrieval after Sequential Memory Processing [O] . Brianne C. Wartman, Jennifer Gabel, Matthew R. Holahan 2010

机译：顺序扣留记忆后，前扣带的失活揭示了对皮质网络的依赖性，用于远程空间记忆的检索。
7. A Survey of Resource Management for Processing-In-Memory and Near-Memory Processing Architectures [O] . Kamil Khan, Sudeep Pasricha, Ryan Gary Kim 2020

机译：用于处理内存和近记忆处理架构的资源管理调查
8. Systeme Memoire pour Architecture Multiprocesseur sur Bus Unique. Application au Systeme SCQM (Memory Systems for Single Bus Multiprocessor Architecture. Application to the SCQM System) [R] . Cekleov, M. 1986

机译：systeme memoire pour architecture multiprocesseur sur Bus Unique。应用程序au systeme sCQm（用于单总线多处理器体系结构的存储器系统。应用于sCQm系统）

Application-Transparent Near-Memory Processing Architecture with Memory Channel Network

摘要

著录项

相似文献

相关主题

期刊订阅