首页> 外文期刊>IEEE Transactions on Image Processing >Multiscale modeling and estimation of motion fields for video coding
【24h】

Multiscale modeling and estimation of motion fields for video coding

机译:视频编码的运动场多尺度建模和估计

获取原文
获取原文并翻译 | 示例
       

摘要

We present a systematic approach to forward-motion-compensated predictive video coding. The first step is the definition of a flexible model that compactly represents motion fields. The inhomogeneity and spatial coherence properties of motion fields are captured using linear multiscale models. One possible design is based on linear finite elements and yields a multiscale extension of the triangle motion compensation (TMC) method. The second step is the choice of a computational technique that identifies the coefficients of the linear model. We study a modified optical flow technique and minimize a cost function closely related to Horn and Schunck's (1981) criterion. The cost function balances accuracy and complexity of the motion compensated predictor and is viewed as a measure of goodness of the motion field. It determines not only the coefficients of the model, but also the quantization method. We formulate the estimation and quantization problems jointly as a discrete optimization problem and solve it using a fast multiscale relaxation algorithm. A hierarchical extension of the algorithm allows proper handling of large displacements. Simulations on a variety of video sequences have produced improvements over TMC and over the half-pel-accuracy, full-search block matching algorithm, in excess of 0.5 dB in average. The results are visually superior as well. In particular, the reconstructed video is entirely free of blocking artifacts.
机译:我们提出了一种前向运动补偿的预测视频编码的系统方法。第一步是定义一个紧凑表示运动场的灵活模型。使用线性多尺度模型可以捕获运动场的不均匀性和空间相干性。一种可能的设计基于线性有限元,并且产生了三角运动补偿(TMC)方法的多尺度扩展。第二步是选择一种识别线性模型系数的计算技术。我们研究了一种改进的光流技术,并将与Horn和Schunck(1981)准则密切相关的成本函数最小化。代价函数平衡了运动补偿预测器的准确性和复杂性,并被视为运动场优劣的度量。它不仅确定模型的系数,而且确定量化方法。我们将估计和量化问题共同表述为离散优化问题,并使用快速多尺度松弛算法对其进行求解。该算法的分层扩展允许正确处理大位移。在各种视频序列上进行的仿真已对TMC和半像素精度的全搜索块匹配算法进行了改进,平均改进了0.5 dB以上。结果在视觉上也很出色。特别地,重建的视频完全没有阻塞伪像。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号