首页> 外文学位 >Efficient coding techniques for high definition video.
【24h】

Efficient coding techniques for high definition video.

机译:高清晰度视频的高效编码技术。

获取原文
获取原文并翻译 | 示例

摘要

High definition (HD) video contents become popular and displays of higher resolution such as ultra definition are emerging in recent years. The conventional video coding standards offer excellent coding performance at lower bit-rates. However, their coding performance for HD video contents is not as efficient. The objective of this research is to develop a set of efficient coding tools or techniques to offer a better coding gain for HD video. The following three techniques are studied in this work.;First, we present a Joint first-order-residual/second-order residual (FOR/SOR) coding technique. The FOR/SOR algorithm that incorporates a few advanced coding techniques is proposed for HD video coding. For the FOR coder, the block-based prediction is used to exploit both temporal and spatial correlation in an original frame surface for coding efficiency. However, there still exists structural noise in the prediction residuals. We design an efficient SOR coder to encode the residual image. Block-adaptive bit allocation between the FOR and the SOR coders is developed to enhance the coding performance, which corresponds to selecting two different quantization parameters in the FOR and the SOR coders in different spatial regions. It is shown by experimental results that the proposed FOR/SOR coding algorithm outperforms H.264/AVC significantly in HD video coding with an averaged bit rate saving of 15.6%.;Second, we develop two advanced processing techniques, which are referred as to two layered transform with sparse representation (TTSR) and slant residual shift (SRS), for prediction residuals so as to improve coding efficiency. Prediction residues often show a nonstationary property, and the DCT becomes sub-optimal and yields undesired artifacts. The proposed TTSR algorithm makes use of sparse representation and is targeted toward the state-of-the-art video coding standard, High Efficiency Video Coding (HEVC), in this work. A dictionary is adaptively trained to contain featured patterns of residual signals so that a high portion of the energy in a structured residual can be efficiently coded with sparse coding. Then, the following DCT in cascade is applied to the remaining signal after spare coding. The use of multiple representations is justified with an R-D analysis, and the two transforms successfully complement each other. The SRS technique is to align dominant prediction residuals of inter-predicted frames with the horizontal or the vertical direction via row-wise or column-wise circular shift before the 2-D DCT. To determine the proper shift of pixels, we classify blocks into several types, each of which is assigned an index number. Then, these indices are sent to the decoder as signaling flags, which can be viewed as the mode information of the SRS technique. It is demonstrated by experimental results that the proposed algorithm outperforms the HEVC.;Third, we make a contribution to the HEVC with several efficient coding tools incorporated into the two context adaptive entropy coding, i.e., Context Adaptive Variable Length Coding (CAVLC) and Context Adaptive Binary Arithmetic Coding (CABAC). The proposed tableless VLC coding scheme removes all the tables used for the residual coding, yet yields negligible changes to the coding performance. Statistical property of a symbol is employed to replace the conventional tables with a mathematical model and improve a coding gain with a high order Markov model. On top of that, a context for a significance map coding in a large transform block is newly designed. The proposed context model removes a dependency of neighbor significant coefficients along the same scanning line, and, thus, it enhances a throughput of the CABAC. The proposed algorithm extends to the mode dependent coefficient scanning method for a large transform block. The proposed algorithm has negligible effect on the coding performance while it significantly improves the parallelization.
机译:近年来,高清(HD)视频内容变得越来越流行,并且诸如高分辨率的高分辨率显示正在兴起。常规的视频编码标准以较低的比特率提供了出色的编码性能。但是,它们对高清视频内容的编码性能不那么有效。这项研究的目的是开发一套有效的编码工具或技术,以为高清视频提供更好的编码增益。在这项工作中,研究了以下三种技术:首先,我们提出一种联合一阶残差/二阶残差(FOR / SOR)编码技术。针对高清视频编码,提出了结合了一些高级编码技术的FOR / SOR算法。对于FOR编码器,使用基于块的预测来利用原始帧表面中的时间和空间相关性以提高编码效率。但是,预测残差中仍然存在结构噪声。我们设计了一种有效的SOR编码器来对残差图像进行编码。开发了FOR和SOR编码器之间的块自适应位分配以增强编码性能,这对应于在不同空间区域中在FOR和SOR编码器中选择两个不同的量化参数。实验结果表明,所提出的FOR / SOR编码算法在高清视频编码中明显优于H.264 / AVC,平均比特率节省15.6%。其次,我们开发了两种先进的处理技术,分别是采用稀疏表示(TTSR)和倾斜残差移位(SRS)的两层变换来预测残差,从而提高编码效率。预测残差通常显示出不稳定的性质,并且DCT变得次优,并产生了不希望的伪像。这项拟议的TTSR算法利用稀疏表示,并且针对最新的视频编码标准高效视频编码(HEVC)。对字典进行自适应训练,使其包含残差信号的特征模式,以便可以用稀疏编码有效地编码结构化残差中的大部分能量。然后,级联的后续DCT被应用于备用编码之后的剩余信号。通过R-D分析证明使用多种表示形式是合理的,并且两个变换成功地互补。 SRS技术是在二维DCT之前通过行或列的圆移位将帧间预测帧的主要预测残差与水平或垂直方向对准。为了确定像素的正确偏移,我们将块分为几种类型,每种类型都分配了一个索引号。然后,将这些索引作为信令标记发送到解码器,可以将其视为SRS技术的模式信息。实验结果表明,所提出的算法优于HEVC。第三,结合两种上下文自适应熵编码(CAVLC)和Context自适应熵编码,为HEVC做出了贡献。自适应二进制算术编码(CABAC)。所提出的无表VLC编码方案删除了​​用于残差编码的所有表,但对编码性能的影响却微不足道。利用符号的统计特性来用数学模型代替常规表格,并通过高阶马尔可夫模型提高编码增益。最重要的是,新设计了用于大变换块中的有效图编码的上下文。所提出的上下文模型消除了沿着同一条扫描线的相邻有效系数的依赖性,从而提高了CABAC的吞吐量。提出的算法扩展到大变换块的模式相关系数扫描方法。所提出的算法对编码性能的影响可以忽略不计,同时可以显着提高并行度。

著录项

  • 作者

    Kang, Je-Won.;

  • 作者单位

    University of Southern California.;

  • 授予单位 University of Southern California.;
  • 学科 Engineering Electronics and Electrical.
  • 学位 Ph.D.
  • 年度 2012
  • 页码 119 p.
  • 总页数 119
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-17 11:42:49

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号