首页> 外文学位 >Interframe estimation and video coding.
【24h】

Interframe estimation and video coding.

机译:帧间估计和视频编码。

获取原文
获取原文并翻译 | 示例

摘要

This thesis consists of two parts: the first part addresses least-squares optimal backward and forward motion-compensated interframe estimation. The second part is devoted to the development of two applications of interframe estimation: backward multiresolution video coding and coding of MRI volumetric data.; We propose an operator-based paradigm for backward motion estimation. It is shown that the backward motion-compensated estimation problem, in a sampled environment, naturally decomposes into a discrete search and a continuous optimization problem. Solution strategies for each of the problems are individually explored, and a fast recursive least squares (RLS) algorithm for the solution of the optimization problem is proposed. We explore a dual operator space that offers potential computational advantages compared to the primary operators, and extend the operator method to higher dimensional spaces.; Two predominant motion estimation methods, known as warping and overlapped-block matching, are studied. We compute optimal interpolation kernels for warping, based on video sequence statistics. Optimal interpolation kernels for typical sequences are far from the usual bilinear or affine kernels, and offer significant improvements in MSE performance.; Warping and overlapped-block methods are very different in their approach to addressing motion field ambiguities (ambiguities arise from a subsampled motion field). Each of them is best suited to certain motion scenarios. Noting that these scenarios often coexist in video, we motivate a joint warping/overlapped-block methodology. Through a generalization of the techniques developed in optimal warping, optimal joint warping/overlapped-block kernels are computed.; The problem of backward coding in a multiresolution environment is analyzed. It is shown that a direct band-to-band estimation of coefficients in a maximally subsampled (wavelet) multiresolution framework is generally not possible, due to aliasing effects. A method is proposed to circumvent the aliasing problem, and a coding scheme is built around it. Quantization of estimation errors (residues) is performed through zerotree wavelet coding. Because of interdependencies in the quantization of estimation error at a given resolution and motion estimation at higher resolutions, "true" zerotrees are not computable, thus we construct a substitute zerotree. The resulting coder has a rate-distortion performance comparable to that provided by a typical MPEG coder.; Finally, interframe coding of volumetric MRI data is considered. For the first time, this work reports coding gain (over intraframe coding) for interframe coding of MRI data. The interframe estimator used in this coder addresses the volumetric nature of the data by using a warping estimator and the high-variance noise of MRI through in-loop filtering. Quantization of estimation errors is performed through a zerotree wavelet coder.
机译:本文由两部分组成:第一部分处理最小二乘最优后向和前向运动补偿帧间估计。第二部分致力于帧间估计的两种应用的开发:后向多分辨率视频编码和MRI体数据的编码。我们提出了一种基于算子的范式进行后向运动估计。结果表明,在采样环境中,后向运动补偿估计问题自然分解为离散搜索和连续优化问题。分别探讨了每个问题的解决方案策略,并提出了一种用于解决优化问题的快速递归最小二乘(RLS)算法。我们探索了与主运算符相比具有潜在计算优势的对偶运算符空间,并将该运算符方法扩展到更高维的空间。研究了两种主要的运动估计方法,即扭曲和重叠块匹配。我们根据视频序列统计信息计算用于变形的最佳插值内核。典型序列的最佳插值内核与通常的双线性或仿射内核相差甚远,并且在MSE性能上有显着提高。翘曲和重叠块方法在解决运动场模糊性方面的方法大不相同(歧义来自二次采样运动场)。它们中的每一个最适合某些运动场景。注意到这些场景通常在视频中共存,因此我们提出了一种联合变形/重叠块方法。通过优化翘曲中开发的技术的概括,可以计算出最佳的联合翘曲/重叠块内核。分析了多分辨率环境中的反向编码问题。结果表明,由于混叠效应,在最大子采样(小波)多分辨率框架中,直接进行系数的带间估计通常是不可能的。提出了一种解决混叠问题的方法,并围绕其建立了一种编码方案。通过零树小波编码对估计误差(残差)进行量化。由于给定分辨率下的估计误差量化和高分辨率下的运动估计之间存在相互依赖性,因此“真实”零树无法计算,因此我们构建了替代零树。所得的编码器具有与典型的MPEG编码器相媲美的速率失真性能。最后,考虑了体积MRI数据的帧间编码。这项工作第一次报告了用于MRI数据的帧间编码的编码增益(超过帧内编码)。该编码器中使用的帧间估计器通过使用翘曲估计器和MRI通过环路滤波的高方差噪声来解决数据的体积性质。通过零树小波编码器执行估计误差的量化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号