首页> 外文期刊>IEEE Transactions on Circuits and Systems for Video Technology >Boundary macroblock padding in MPEG-4 video decoding using a graphics coprocessor
【24h】

Boundary macroblock padding in MPEG-4 video decoding using a graphics coprocessor

机译:使用图形协处理器的MPEG-4视频解码中的边界宏块填充

获取原文
获取原文并翻译 | 示例

摘要

MPEG-4 is the latest multimedia coding standard that supports object-based coding and manipulation of natural video and synthetic graphics objects. Due to its various features and high coding efficiency, MPEG-4 is becoming popular in video streaming applications. Many graphics coprocessors provide the acceleration of inverse discrete cosine transform (IDCT) and motion compensation for real-time video decoding. Therefore, it is desired to use the graphics coprocessors to accelerate MPEG-4 video decoding as well. Since MPEG-4 video decoding for rectangular video objects is similar to other video coding standards, e.g., MPEG-2, the IDCT and motion compensation can still be executed on the graphics coprocessors. However, we have found that boundary macroblock padding, which is an essential processing step in decoding arbitrarily shaped video objects, could not be efficiently accelerated on the graphics coprocessors due to its complexity. Although we can implement the boundary macroblock padding on the host processor, the frame data processed on the graphics coprocessor need to be transferred to the host processor for padding. In addition, the padded data on the host processor need to be sent back to the graphics coprocessor to be used as a reference for subsequent frames. To avoid this overhead, we present two approaches of boundary macroblock padding. In the first approach, the boundary macroblock padding is partitioned into two tasks, one of which the host processor can perform without the overhead of data transfers. In the second approach, we propose two new instructions and an algorithm that can be easily adopted in the next-generation graphics coprocessors or mediaprocessors, which gives a performance improvement of up to a factor of nine compared to that with the Pentium III.
机译:MPEG-4是最新的多媒体编码标准,它支持基于对象的编码以及自然视频和合成图形对象的处理。由于其各种功能和高编码效率,MPEG-4在视频流应用中变得越来越流行。许多图形协处理器为实时视频解码提供了逆离散余弦变换(IDCT)的加速和运动补偿。因此,期望也使用图形协处理器来加速MPEG-4视频解码。由于用于矩形视频对象的MPEG-4视频解码类似于其他视频编码标准,例如MPEG-2,因此IDCT和运动补偿仍然可以在图形协处理器上执行。但是,我们发现,边界宏块填充是解码任意形状的视频对象中必不可少的处理步骤,由于其复杂性,无法在图形协处理器上有效地加速。尽管我们可以在主机处理器上实现边界宏块填充,但需要将在图形协处理器上处理的帧数据传输到主机处理器以进行填充。另外,主机处理器上的填充数据需要发送回图形协处理器,以用作后续帧的参考。为了避免这种开销,我们提出了两种边界宏块填充方法。在第一种方法中,边界宏块填充被划分为两个任务,主机处理器可以执行其中的一项任务,而没有数据传输的开销。在第二种方法中,我们提出了两个新的指令和一种可以在下一代图形协处理器或媒体处理器中轻松采用的算法,与奔腾III相比,其性能提高了多达九倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号