首页> 外文期刊>IEEE Transactions on Computers >An Integrated Memory Array Processor for Embedded Image Recognition Systems
【24h】

An Integrated Memory Array Processor for Embedded Image Recognition Systems

机译:用于嵌入式图像识别系统的集成存储器阵列处理器

获取原文
获取原文并翻译 | 示例

摘要

Embedded processors for video image recognition in most cases not only need to address the conventional cost (die size and power) versus real-time performance issue, but must also maintain high flexibility due to the immense diversity of recognition targets, situations, and applications. This paper describes IMAP, a highly parallel SIMD linear processor and memory array architecture that addresses these trade-off requirements. By using parallel and systolic algorithmic techniques, but based on a simple linear array architecture, IMAP successfully exploits not only the straightforward per-image row data level parallelism (DLP), but also the inherent DLP of other memory access patterns frequently found in various image recognition tasks, while allowing programming to be done using an explicit parallel C language (1DC). We describe and evaluate IMAP-CE, one of the latest IMAP processors, integrating 128 100 MHz 8 bit 4-way VLIW PEs, 128 2 KByte RAMs, and one 16 bit RISC control processor onto a single chip. The PE instruction set is enhanced to support 1DC code. The die size of IMAP-CE is 11 times 11 mm^{2} integrating 32.7 M transistors, while the power consumption is, on average, approximately 2 watts. IMAP-CE is evaluated mainly by comparing its performance while running 1DC code with that of a 2.4 GHz Intel P4 running optimized C code. Based on the use of parallelizing techniques, benchmark results show a speed increase of up to 20 times for image filter kernels and of 4 times for a full image recognition application.
机译:在大多数情况下,用于视频图像识别的嵌入式处理器不仅需要解决常规成本(芯片尺寸和功耗)与实时性能问题,而且由于识别目标,情况和应用的多样性,还必须保持高度的灵活性。本文介绍了IMAP,这是一种高度并行的SIMD线性处理器和存储器阵列架构,可满足这些折衷要求。通过使用并行和收缩算法技术,但基于简单的线性阵列体系结构,IMAP成功地利用了不仅简单的每图像行数据级并行性(DLP),而且还利用了各种图像中经常发现的其他内存访问模式的固有DLP识别任务,同时允许使用显式并行C语言(1DC)完成编程。我们描述并评估了IMAP-CE,这是最新的IMAP处理器之一,它在单个芯片上集成了128 100 MHz 8位4路VLIW PE,128个2 KB RAM和一个16位RISC控制处理器。 PE指令集得到增强,以支持1DC代码。 IMAP-CE的芯片尺寸是11毫米^ {2}的11倍,集成了3270万个晶体管,而功耗平均约为2瓦。 IMAP-CE的评估主要是通过比较运行1DC代码时的性能与运行优化的C代码的2.4 GHz Intel P4时的性能进行比较。基于并行化技术的使用,基准测试结果显示图像过滤器内核的速度提高了20倍,而完整图像识别应用程序的速度提高了4倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号