首页> 外文期刊>IEEE Transactions on Circuits and Systems for Video Technology >Highly Paralleled Low-Cost Embedded HEVC Video Encoder on TI KeyStone Multicore DSP
【24h】

Highly Paralleled Low-Cost Embedded HEVC Video Encoder on TI KeyStone Multicore DSP

机译:TI Keystone MultiCore DSP上高度平行的低成本嵌入式HEVC视频编码器

获取原文
获取原文并翻译 | 示例

摘要

Although HEVC, the emerging video coding standard, has doubled the coding performance of its predecessor H.264/AVC, its significantly increased computational complexity imposes great obstacles for HEVC encoders to be employed in real-time applications with embedded processors, such as digital signal processors (DSPs). In this paper, a TI Keystone multicore TMS320C6678 DSP-based highly paralleled low-cost fast HEVC encoding solution is well designed and implemented. First, the overall structure of HEVC encoder with CTU-level parallelism is re-designed to well support the encoding parallelism, with full consideration of the hardware characteristics. Second, a low-delay and low-memory multicore data transmission mechanism is proposed to reduce the latency of data access between internal L2 memory and external DDR3. Third, the encoding bottlenecks, i.e., the most time-consuming encoding modules, are identified and optimized for acceleration with TI powerful C6000 SIMD instructions. Experimental results show that our proposed HEVC encoder on TI TMS320C6678 DSPs can significantly improve the real-time capacity with tolerable performance loss, 0.93 dB performance loss under on average 465.50 times speedup as compared to CPU-based HM reference software, more specifically, which makes it desirable in power-constrained real-time video applications.
机译:虽然HEVC,新兴的视频编码标准翻了一番,但其前身H.264 / AVC的编码性能翻了一番,但其显着提高的计算复杂性对HEVC编码器施加了很大的障碍,以便在具有嵌入式处理器的实时应用中使用,例如数字信号处理器(DSP)。本文采用了基于TI Keystone MultiCore TMS320C6678 DSP的高度并行低成本快速HEVC编码解决方案。首先,具有CTU级并行性的HEVC编码器的整体结构被重新设计,以极好地支持编码并行性,充分考虑了硬件特性。其次,提出了低延迟和低存储器多核数据传输机制,以减少内部L2存储器和外部DDR3之间的数据访问的延迟。第三,识别编码瓶颈,即最耗时的编码模块,用于与TI强大的C6000 SIMD指令加速度。实验结果表明,我们提出的TI TMS320C6678 DSPS上提出的HEVC编码器可以显着提高实时容量,与基于CPU的HM参考软件相比,平均较为465.50倍的实时容量,0.93 dB性能损失。它希望在功率约束的实时视频应用中。

著录项

  • 来源
  • 作者单位

    Beihang Univ Beijing Key Lab Digital Media Sch Comp Sci & Engn Beijing 100191 Peoples R China|Beihang Univ State Key Lab Virtual Real Technol & Syst Beijing 100191 Peoples R China;

    Beihang Univ Beijing Key Lab Digital Media Sch Comp Sci & Engn Beijing 100191 Peoples R China|China Acad Elect & Informat Technol Beijing 100041 Peoples R China;

    Beihang Univ Beijing Key Lab Digital Media Sch Comp Sci & Engn Beijing 100191 Peoples R China|Beihang Univ State Key Lab Virtual Real Technol & Syst Beijing 100191 Peoples R China;

    Beihang Univ Beijing Key Lab Digital Media Sch Comp Sci & Engn Beijing 100191 Peoples R China|Beihang Univ State Key Lab Virtual Real Technol & Syst Beijing 100191 Peoples R China;

    Beihang Univ Beijing Key Lab Digital Media Sch Comp Sci & Engn Beijing 100191 Peoples R China|Shandong Univ Sci & Technol Shandong Prov Key Lab Wisdom Mine Informat Techno Qingdao 266590 Shandong Peoples R China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    HEVC; TMS320C6678; SIMD; multicore DSP; embedded system;

    机译:HEVC;TMS320C6678;SIMD;多核DSP;嵌入式系统;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号