【24h】

Configurable Data Memory For Multimedia Processing

机译:用于多媒体处理的可配置数据存储器

获取原文
获取原文并翻译 | 示例

摘要

In modern multimedia applications, memory bottleneck can be alleviated with special stride data accesses. Data elements in stride access can be retrieved in parallel with parallel memories, in which the idea is to increase memory bandwidth with several memory modules working in parallel and feed the processor with only necessary data. Arbitrary stride access capability with interleaved memories is described in previous research where the skewing scheme is changed at run time according to the currently used stride. This paper presents the improved schemes which are adapted to parallel memories. The proposed novel parallel memory implementation allows conflict free accesses with all the constant strides which has not been possible in prior application specific parallel memories. Moreover, the possible access locations are unrestricted and the accessed data element count equals to the number of memory modules. Timing and area estimates are given for Altera Stratix FPGA and 0.18 micrometer CMOS process with memory module count from 2 to 32. The FPGA results show 129 MHz clock frequency for a system with 16 memory modules when read and write latencies are 3 and 2 clock cycles, respectively. The complexity of the proposed system is shown to be a trade-off between application specific and highly configurable parallel memory system.
机译:在现代多媒体应用中,可以通过特殊的跨步数据访问来缓解存储瓶颈。大步访问中的数据元素可以与并行存储器并行检索,其思想是通过几个并行工作的存储器模块来增加存储器带宽,并仅向处理器提供必要的数据。在先前的研究中描述了具有交错存储器的任意步幅访问能力,其中,根据当前使用的步幅在运行时更改倾斜方案。本文提出了适用于并行存储器的改进方案。所提出的新颖的并行存储器实现允许与所有恒定步幅的无冲突访问,这在现有的专用并行存储器中是不可能的。此外,可能的访问位置不受限制,并且所访问的数据元素计数等于存储模块的数量。给出了Altera Stratix FPGA和0.18微米CMOS工艺的时序和面积估计,其中存储模块数为2到32。FPGA结果显示,当读写延迟分别为3和2个时钟周期时,具有16个存储模块的系统的时钟频率为129 MHz , 分别。所提出的系统的复杂性被证明是在专用和高度可配置的并行存储系统之间的权衡。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号