首页> 外文学位 >Advanced motion modeling for three-dimensional video coding.
【24h】

Advanced motion modeling for three-dimensional video coding.

机译:用于三维视频编码的高级运动建模。

获取原文
获取原文并翻译 | 示例

摘要

Driven by new multimedia applications and the growing demand for more flexible and efficient transmission of video, a new approach to video coding has been recently proposed as an alternative to classical hybrid schemes. Instead of sequential frame-based predictive processing, the new approach is based on spatio-temporal 3D transforms, open-loop non-predictive processing, and embedded quantization and coding. This thesis investigates motion modeling for this new coding environment, as well as the impact of such modeling on both coder design and performance.; The first aspect of this thesis deals with video coding based on 3D discrete cosine transform (DCT). We analyze 3D DCT spectrum properties of a globally translating image and show how to use its characteristic footprint for fast and efficient video coding. Previous approaches to 3D DCT video coding have lead to rather modest compression gains due to a limited use of motion characteristics in the transform domain. We develop a coefficient scanning order that adapts to motion, unlike the fixed zig-zag scanning of JPEG. We combine this adaptive scanning with a new 3D quantization model to design a low-complexity 3D DCT video coder. The new coder consistently outperforms MPEG-2 both subjectively and objectively (by more than 1.5 dB) at about 25% reduced complexity, while approaching the performance of MPEG-4 (within 0.8 dB) at less than half computational complexity.; The second aspect of this thesis involves the role of motion in emerging video coders based on 3D discrete wavelet transform (DWT) and motion-compensated temporal filtering (MCTF). Motion invertibility, central to the optimality of lifted MCTF implementation, is first investigated. We introduce a metric for invertibility error between two motion fields. We develop advanced motion inversion methods and demonstrate their effectiveness in improving the update lifting step. Experimental results confirm that a better motion inversion, quantified by lower invertibility error, leads to an increase in coding gain up to 0.5 dB over simpler inversion techniques. We propose a new method for occlusion-aware modeling and estimation of motion fields and use it to create an adaptive 3D DWT coding structure. Implicit modeling of occluded/uncovered areas, combined with the use of longer wavelet kernels, improves both the prediction and update lifting steps and results in the overall compression gain of up to 1 dB over a non-adaptive coder. (Abstract shortened by UMI.)
机译:在新的多媒体应用和对更加灵活和有效的视频传输的不断增长的需求的推动下,最近提出了一种新的视频编码方法,以作为经典混合方案的替代方法。代替基于顺序帧的预测处理,新方法基于时空3D转换,开环非预测处理以及嵌入式量化和编码。本文研究了针对这种新编码环境的运动建模,以及这种建模对编码器设计和性能的影响。本文的第一方面涉及基于3D离散余弦变换(DCT)的视频编码。我们分析了全局翻译图像的3D DCT频谱属性,并展示了如何使用其特征足迹进行快速有效的视频编码。由于在变换域中运动特性的有限使用,用于3D DCT视频编码的先前方法已导致相当适度的压缩增益。我们开发了适应运动的系数扫描顺序,这与JPEG的固定锯齿形扫描不同。我们将此自适应扫描与新的3D量化模型相结合,以设计一种低复杂度的3D DCT视频编码器。新的编码器在主观和客观上都比MPEG-2更好(超过1.5 dB),降低了约25%的复杂度,同时以不到一半的计算复杂度接近MPEG-4的性能(在0.8 dB以内)。本文的第二个方面涉及运动在基于3D离散小波变换(DWT)和运动补偿时间滤波(MCTF)的新兴视频编码器中的作用。首先研究了运动可逆性,这对于提升MCTF实施的最优性至关重要。我们介绍了两个运动场之间可逆性的度量。我们开发了先进的运动反转方法,并展示了它们在改进更新提升步骤中的有效性。实验结果证实,与较简单的反演技术相比,更好的运动反演(通过较低的可逆性误差量化)可导致编码增益增加多达0.5 dB。我们提出了一种用于遮挡感知的运动场建模和估计的新方法,并将其用于创建自适应3D DWT编码结构。对遮挡/未遮盖区域进行隐式建模,再结合使用更长的小波核,可以改善预测和更新提升步骤,并且在非自适应编码器上的总压缩增益高达1 dB。 (摘要由UMI缩短。)

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号