首页> 外文学位 >Dependent R-D modeling for H.264/SVC bit allocation.
【24h】

Dependent R-D modeling for H.264/SVC bit allocation.

机译:H.264 / SVC位分配的相关R-D建模。

获取原文
获取原文并翻译 | 示例

摘要

In this research, we investigate model-based bit allocation algorithms for H.264/SVC, which is newly standardized as a scalable extension of H.264/AVC. Despite its importance in video coding, inter-dependency between coding units is often indirectly addressed in conventional single layer bit allocation algorithms. This simplified treatment is adopted due to the complexity involved in its explicit consideration. In H.264/SVC, inter-dependency between coding units becomes even more involved and the development of an optimal bit allocation algorithm imposes an even higher challenge.;To address the bit allocation problem for H.264/SVC, we study dependent rate and distortion (R-D) models for temporal and quality scalabilities of H.264/SVC. Traditional R-D models represent rate and distortion characteristics as functions of quantization parameter (QP), known as the Q-domain analysis, or the percentage of zero transform coefficients, known as the rho-domain analysis. Unlike previous models, we examine dependent R-D characteristics based on the self-domain (S-domain) analysis (namely, R- and D-domain analysis for the rate and the distortion characteristics, respectively), where the R-D characteristics of a dependent coding unit are expressed as the R-D characteristics of its reference and/or base layers. With the proposed R-D models, the complex dependent R-D characteristics can be simplified to be a linear sum of functions of a single parameter. As a result, the bit allocation problem can be solved elegantly.;The temporal dependency of the video signal is first studied. According to the self D-domain study, the distortion of a coding unit is subject to temporal dependency. In contrast, its rate is relatively independent of its references. This leads to a linear dependent distortion model for the temporal scalability. Then, the research is extended to the quality scalability. Contrary to the temporal-layer case, the self R-domain analysis reveals inter-dependency of the rate characteristics. On the other hand, its distortion is independent of those in the preceding layers. For this reason, the quality-layer dependent rate is modeled as the linear sum of base layer rate functions. The performance of the proposed dependent R-D models is verified by comparing their R-D estimation results with actual R-D data. After the initial study on the dependent R-D models, we analyzed the proposed R-D models to understand the physical meaning of the model parameters. We could learn from the analysis that the model parameters convey important information about the inter-layer dependence in the T-Q scalability.;We conduct studies on two bit allocation algorithms, which are formulated as the Lagrangian optimization problem. First, we investigate a temporal layer bit allocation problem based on its dependent distortion model. Then, we examine the joint quality-temporal layer bit allocation problem by combining temporal and quality layer R-D models. One important advantage of the proposed R-D models is that they allow an analytical solution to the Lagrange equation. With the proposed algorithms, both bit allocation problems are numerically solved at significantly reduced complexity. It is shown by experimental results that the proposed algorithms could produce more efficient scalable bit streams than those by the H.264/SVC reference software codec (JSVM) at various bit rates with different types of test sequences. Moreover, the coding gain of each scalable layer, i.e., T or Q layer, exceeds that obtained by the JSVM benchmark.;The purpose of the R-D characteristics modeling is to develop an efficient and effective rate control algorithm for a video coder. Even though the bit allocation algorithms introduced in the first part of the thesis achieve better performance than the JSVM benchmark, their usage is limited to off-line encoding scenarios. To address the issue, we also conduct study the simplification of the bit allocation algorithms considering real-time encoding scenarios. The performance of the proposed is verified by experimental results in comparison with the JSVM benchmark.;Finally, we also examine the rate control of compressed scalable video under network based video application scenarios. We employ cross-layer approach to the design of a wireless video streaming algorithm. Beginning with a through review on the cross-layer design principles, we identified major challenges and issues with the cross-layer design approach in the realistic application scenarios and they could be well addressed by the proposed algorithm. The computer simulation results verifies the performance of proposed algorithm in comparison with single layer video streaming.
机译:在这项研究中,我们研究了H.264 / SVC的基于模型的位分配算法,该算法新近标准化为H.264 / AVC的可扩展性。尽管其在视频编码中的重要性,但是在常规的单层比特分配算法中常常间接地解决了编码单元之间的相互依赖性。由于其明确考虑的复杂性,因此采用了这种简化的处理方法。在H.264 / SVC中,编码单元之间的相互依赖性变得更加复杂,并且最优比特分配算法的发展提出了更高的挑战。;为解决H.264 / SVC的比特分配问题,我们研究了相关速率H.264 / SVC的时间和质量可伸缩性的失真和失真(RD)模型。传统的R-D模型将速率和失真特性表示为量化参数(QP)的函数,称为Q域分析,或者将零变换系数的百分比表示为rho域分析。与以前的模型不同,我们基于自域(S域)分析(即分别针对速率和失真特性的R域和D域分析)检查依赖的RD特性,其中依赖编码的RD特性单位表示为其参考层和/或基础层的RD特性。使用提出的R-D模型,可以将复杂的依赖R-D特性简化为单个参数函数的线性和。结果,可以很好地解决比特分配问题。;首先研究视频信号的时间依赖性。根据自D域研究,编码单元的失真受到时间依赖性的影响。相反,其速率相对独立于参考。这导致时间可伸缩性的线性相关失真模型。然后,将研究扩展到质量可伸缩性。与时间层情况相反,自R域分析揭示了速率特征的相互依赖性。另一方面,它的失真与前面几层的失真无关。因此,质量层相关速率被建模为基础层速率函数的线性和。通过将其相关的R-D估计结果与实际的R-D数据进行比较,可以验证所提出的相关R-D模型的性能。在对相关R-D模型进行初步研究之后,我们分析了所提出的R-D模型以了解模型参数的物理含义。我们可以从分析中得知,模型参数传达了有关T-Q可伸缩性中层间相关性的重要信息。我们对两个比特分配算法进行了研究,这些算法被公式化为拉格朗日优化问题。首先,我们研究基于其相关失真模型的时间层比特分配问题。然后,我们通过结合时间和质量层R-D模型来研究联合质量-时间层比特分配问题。所提出的R-D模型的一个重要优点是,它们允许对Lagrange方程进行解析。利用所提出的算法,在数值上以显着降低的复杂度解决了两个比特分配问题。实验结果表明,与不同类型的测试序列在各种比特率下,H.264 / SVC参考软件编解码器(JSVM)相比,所提出的算法可产生更有效的可扩展比特流。此外,每个可扩展层(即T或Q层)的编码增益都超过了JSVM基准测试所获得的增益。R-D特性建模的目的是为视频编码器开发一种有效的速率控制算法。尽管本文第一部分介绍的位分配算法比JSVM基准测试具有更好的性能,但它们的使用仅限于离线编码方案。为了解决该问题,我们还考虑了实时编码方案,对比特分配算法进行了简化研究。通过与JSVM基准测试的比较,实验结果验证了该方法的性能。最后,我们还研究了基于网络的视频应用场景下压缩可伸缩视频的速率控制。我们采用跨层方法来设计无线视频流算法。通过对跨层设计原则的全面回顾,我们发现了在实际应用场景中跨层设计方法的主要挑战和问题,并且所提出的算法可以很好地解决这些问题。与单层视频流相比,计算机仿真结果验证了所提算法的性能。

著录项

  • 作者

    Cho, Yongjin.;

  • 作者单位

    University of Southern California.;

  • 授予单位 University of Southern California.;
  • 学科 Engineering Electronics and Electrical.;Multimedia Communications.
  • 学位 Ph.D.
  • 年度 2010
  • 页码 185 p.
  • 总页数 185
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号