首页> 外文学位 >Associative Computing with Resistive Memories
【24h】

Associative Computing with Resistive Memories

机译:电阻记忆的关联计算

获取原文
获取原文并翻译 | 示例

摘要

Power dissipation and memory bandwidth are significant performance bottlenecks in virtually all computer systems. Associative computing with ternary content addressable memories (TCAM) holds the potential to address both problems in a wide range of data intensive workloads. Power dissipation is reduced by eliminating instruction processing and data movement overheads present in a purely RAM based system. Bandwidth demand is lowered by processing data directly on the TCAM chip, thereby decreasing off-chip traffic. Unfortunately, existing SRAM-based TCAM cells are over 90x larger than a DRAM cell at the same technology node, which limits the capacity of commercially available TCAMs to a few megabytes.;This dissertation first examines the integration of gigascale TCAM systems based on resistive memories within a general purpose computing platform. TCAM density is improved by novel, resistive memory cells that exploit phase change and spin-toque transfer magnetoresistive RAM technologies. TCAM chips are organized into a DDR3-compatible DIMM, and are accessed through a software library with zero modifications to the processor or the motherboard. Leveraging associative lookups by the memory circuits and a set of integrated, programmable microcontrollers that execute user-defined kernels on the search results, the proposed TCAM systems achieve average speedups of 4-4.2x and average energy reductions of 6.5-8x as compared to a conventional RAM based system.;The dissertation second presents the case for a database processing unit (DPU) that significantly improves the energy efficiency in big data analytics. The DPU augments a general-purpose processor with 1) an STT-MRAM based scratchpad memory that implements a CAM cell with a 2.45x higher density than an SRAM cell, and 2) an enhanced DMA unit that facilitates the transportation of data between the CAM and memory subsystem. By mapping the relational database operators to the CAM circuits, columns of data are processed directly on the CAM chip in a vectorized manner. This eschews streaming the entire data set in and out of the processor pipeline, and significantly reduces the performance and energy overheads caused by the data movement.
机译:功耗和内存带宽实际上是所有计算机系统中的重要性能瓶颈。具有三态内容可寻址存储器(TCAM)的关联计算具有解决广泛数据密集型工作负载中的两个问题的潜力。通过消除纯粹基于RAM的系统中存在的指令处理和数据移动开销来降低功耗。通过直接在TCAM芯片上处理数据来降低带宽需求,从而减少片外流量。不幸的是,现有的基于SRAM的TCAM单元比同一技术节点上的DRAM单元大90倍以上,这将市售TCAM的容量限制在几兆字节。本文首先研究基于电阻性存储器的千兆级TCAM系统的集成在通用计算平台中。 TCAM密度通过利用相变和自旋矩转移磁阻RAM技术的新型电阻式存储单元得以提高。 TCAM芯片被组织成DDR3兼容的DIMM,并通过对处理器或母板进行零修改的软件库进行访问。利用存储电路和一组集成的可编程微控制器(在搜索结果上执行用户定义的内核)的关联性查找,与传统的TCAM系统相比,拟议的TCAM系统的平均速度提高了4-4.2倍,平均能耗降低了6.5-8倍。传统的基于RAM的系统。论文的第二部分介绍了一种数据库处理单元(DPU)的案例,该单元可显着提高大数据分析中的能源效率。 DPU通过以下方式增强了通用处理器的功能:1)基于STT-MRAM的暂存器存储器,实现了比SRAM单元高2.45倍密度的CAM单元,以及2)便于在CAM之间传输数据的增强的DMA单元和内存子系统。通过将关系数据库运算符映射到CAM电路,可以以矢量化的方式直接在CAM芯片上处理数据列。这避免了将整个数据集流进出处理器流水线,并显着降低了由数据移动引起的性能和能源开销。

著录项

  • 作者

    Guo, Qing.;

  • 作者单位

    University of Rochester.;

  • 授予单位 University of Rochester.;
  • 学科 Computer engineering.;Computer science.
  • 学位 Ph.D.
  • 年度 2015
  • 页码 120 p.
  • 总页数 120
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号