...
首页> 外文期刊>Circuits and Systems for Video Technology, IEEE Transactions on >A Reconfigurable Heterogeneous Multimedia Processor for IC-Stacking on Si-Interposer
【24h】

A Reconfigurable Heterogeneous Multimedia Processor for IC-Stacking on Si-Interposer

机译:用于Si-Interposer上IC堆叠的可重构异构媒体处理器

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

This paper presents a heterogeneous multimedia processor for embedded media applications such as image processing, vision, 3-D graphics and augmented reality (AR), assuming integrated circuit (IC)-stacking on Si-interposer. This processor embeds reconfigurable output drivers for external memory interface to increase memory bandwidth even in a mobile environment. The implemented output driver reconfigures its driving strength according to channel loss between the implemented processor and the memory, so it enables highspeed data communication while achieving 8× higher memory bandwidth compared to previous embedded media processors. The implemented processor includes three main programmable intellectual properties, mode-configurable vector processing units (MCVPUs), a unified filtering unit (UFU), and a unified shader. MCVPUs have 32 integer (16 bit) cores in order to support dual-mode operations between image-level processing and graphics processing. This mode-configuration enables a frame-level pipelining in AR application, so the proposed processor achieves 1.7× higher frame rate compared to the sequential AR processing. UFU supports 16 types of filtering operations only with a single instruction. Most image-level processing consists of various types of filtering operations, so UFU can improve media processing performance and energy-efficiency. UFU also supports texture filtering which is performance bottleneck of common graphics pipeline. A memory-access-efficient (off-chip memory) texturing algorithm named as an adaptive block selection is proposed to enhance texturing performance in 3-D graphics pipeline. UFU has two-level on-chip memory hierarchies, a 512B level-0 (L0) data buffer, and an 8kB level-1 (L1) static random-access memory (SRAM) cache. The small-sized L0 data buffer limits direct references to the large-sized L1 SRAM cache to reduce energy consumed in on-chip memories. Unified shader consists of four homogeneous scalar processing ele- ents (SPEs) for geometry operations in 3-D graphics. Each SPE has single-precision floating-point data-paths, since precision of geometry operations in 3-D graphics is important in today''s handheld devices (high resolution). The proposed media processor is fabricated in 0.13 μm CMOS technology with 4 mm × 4 mm chip size, and dissipates 275 mW for full AR operation.
机译:本文提出了一种用于嵌入式媒体应用的异构多媒体处理器,例如图像处理,视觉,3-D图形和增强现实(AR),并假设在Si-interposer上堆叠了集成电路(IC)。该处理器嵌入用于外部存储器接口的可重新配置输出驱动器,即使在移动环境中也可以增加存储器带宽。所实现的输出驱动器根据所实现的处理器与内存之间的通道损耗重新配置其驱动强度,因此与以前的嵌入式媒体处理器相比,它实现了高速数据通信,同时实现了8倍的高存储带宽。所实现的处理器包括三个主要的可编程知识产权,模式可配置矢量处理单元(MCVPU),统一过滤单元(UFU)和统一着色器。 MCVPU具有32个整数(16位)内核,以支持图像级处理和图形处理之间的双模式操作。这种模式配置可在AR应用中实现帧级流水线化,因此与顺序AR处理相比,该处理器可实现1.7倍更高的帧速率。 UFU仅使用一条指令即可支持16种类型的过滤操作。大多数图像级处理都包含各种类型的过滤操作,因此UFU可以提高媒体处理性能和能源效率。 UFU还支持纹理过滤,这是常见图形管线的性能瓶颈。提出了一种称为自适应块选择的内存访问有效(片外内存)纹理化算法,以提高3-D图形管线中的纹理化性能。 UFU具有两级片上存储器层次结构,一个512B级别0(L0)数据缓冲区和一个8kB级别1(L1)静态随机存取存储器(SRAM)缓存。小型L0数据缓冲区限制了对大型L1 SRAM高速缓存的直接引用,以减少片上存储器的能耗。统一着色器包含四个均质标量处理元素(SPE),用于3D图形中的几何运算。每个SPE具有单精度浮点数据路径,因为在当今的手持设备(高分辨率)中3D图形中几何运算的精度非常重要。拟议的媒体处理器采用0.13μmCMOS技术制造,芯片尺寸为4 mm×4 mm,耗散275 mW的功率以实现完全AR操作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号