【24h】

A Multi-ported Memory Compiler Utilizing True Dual-Port BRAMs

机译:利用真正的双端口BRAM的多端口内存编译器

获取原文
获取原文并翻译 | 示例

摘要

Recent work has shown how multi-ported RAMs can be built out of dual-ported RAMs. Such techniques combine two structures: a set of "data banks" to hold the data, and a method for selecting the bank containing the last-written data, often called a live-value table (LVT). Most previous work has focused on the design of the LVT to reduce area and improve performance. In this paper, we instead reduce area by optimizing the design of the "data banks" portion. The optimization is embedded into a memory compiler that solves a set cover problem. When the set cover problem is solved optimally, the data banks use minimum area. Our technique applies to multi-ported RAMs that have a structural pattern we describe as "switched ports". Switched ports are a generalization of true ports, where a certain number of write ports can be dynamically switched into a possibly different number of read ports using one common read/write control signal. Furthermore, a given application may have multiple sets, each set with a different read/write control. While previous work generates multi-port RAM solutions that contain only true ports, or only simple ports, we contend that using only these two models is too limiting and prevents optimizations from being applied. Experimental results on 10 random instances of multi-port RAMs show 17% BRAM reduction on average compared to the best of other approaches. The compiler and a fully parameterized Verilog implementation is released as an open source library. The library has been extensively tested using Altera's EDA tools.
机译:最近的工作表明如何从双端口RAM中构建多端口RAM。这种技术结合了两种结构:一组用于保存数据的“数据库”,以及一种用于选择包含最后写入的数据的库的方法,通常称为活值表(LVT)。以前的大多数工作都集中在LVT的设计上,以减小面积并提高性能。在本文中,我们通过优化“数据库”部分的设计来减少面积。该优化嵌入解决了设置覆盖问题的内存编译器中。当设置覆盖问题得到最佳解决时,数据库将使用最小面积。我们的技术适用于具有结构模式(称为“交换端口”)的多端口RAM。交换端口是真实端口的概括,其中可以使用一个公共的读/写控制信号将一定数量的写端口动态切换为可能不同数量的读端口。此外,给定的应用程序可以具有多个集合,每个集合具有不同的读/写控件。尽管先前的工作生成了仅包含真实端口或仅包含简单端口的多端口RAM解决方案,但我们认为仅使用这两种模型过于局限,因而无法应用优化。在10个随机的多端口RAM实例上的实验结果表明,与其他最佳方法相比,BRAM平均减少了17%。编译器和完全参数化的Verilog实现作为开源库发布。该库已经使用Altera的EDA工具进行了广泛的测试。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号