首页> 外文会议>IEEE Custom Integrated Circuits Conference >An energy-efficient coarse-grained dynamically reconfigurable fabric for multiple-standard video decoding applications
【24h】

An energy-efficient coarse-grained dynamically reconfigurable fabric for multiple-standard video decoding applications

机译:高能效的粗粒度动态可重构结构,适用于多标准视频解码应用

获取原文

摘要

In this paper, we introduce a coarse-grained dynamically reconfigurable fabric, named Reconfigurable Processing Unit (RPU), which is implemented on a 5.4×3.1 mm2 silicon with TSMC 65 nm LP1P8M technology. This fabric consists of 16×16 multi-functional Processing Elements (PEs) interconnected by an area-efficient Line-Switched Mesh Connect (LSMC) routing. A Hierarchical Configuration Context (HCC) organization scheme is proposed to reduce the scale of the context memory and enhance configuration efficiency. Two reconfigurable processors are then designed and fabricated to verify the proposed techniques. One processor (called REMUS_HPP) integrates two RPUs, targeting the high performance applications. REMUS_HPP could decode 1920×1080@30fps H.264 streams with 280mW under 200MHz, achieving a performance gain of 1.81x and a 14.3x energy efficiency improvement over XPP-III. The other processor (called REMUS_LPP) integrates only one RPU, targeting the low power applications. REMUS_LPP could decode 720×480@35fps H.264 streams with 24.81mW under 75MHz, achieving a 76% power reduction and a 3.96x energy efficiency improvement compared with ADRES. More importantly, RPU is not only limited to video decoding applications. It can also be used to process some other computation-intensive applications and the corresponding analysis is given in this paper as well.
机译:在本文中,我们介绍了一种称为可重配置处理单元(RPU)的粗粒度动态可重配置结构,该结构在采用台积电65纳米LP1P8M技术的5.4×3.1 mm 2 硅上实现。该结构由通过区域高效的线路交换网状连接(LSMC)路由互连的16×16多功能处理元件(PE)组成。提出了一种分层配置上下文(HCC)组织方案,以减少上下文存储器的规模并提高配置效率。然后设计和制造了两个可重构处理器,以验证所提出的技术。一个处理器(称为REMUS_HPP)集成了两个RPU,以高性能应用为目标。 REMUS_HPP可以在200MHz下以280mW解码1920×1080 @ 30fps H.264流,与XPP-III相比,性能增益为1.81倍,能效提高了14.3倍。另一个处理器(称为REMUS_LPP)仅集成一个RPU,针对低功耗应用。 REMUS_LPP可以在75MHz频率下以24.81mW解码720×480 @ 35fps H.264流,与ADRES相比,功耗降低了76%,能源效率提高了3.96倍。更重要的是,RPU不仅限于视频解码应用。它也可以用于处理其他一些计算密集型应用程序,并且在本文中也给出了相应的分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号