首页> 外文学位 >Spatial and temporal prediction schemes for object-based digital video.
【24h】

Spatial and temporal prediction schemes for object-based digital video.

机译:基于对象的数字视频的时空预测方案。

获取原文
获取原文并翻译 | 示例

摘要

Object-based video compression schemes have recently gained tremendous interest, and are the driving force of the future MPEG 4 and 7 video coding standards. These object-based coders not only provide for a higher compression efficiency, but they can also support content-based functionalities for future multimedia applications. In this thesis, spatial and temporal prediction techniques are investigated for these newer object-based compression schemes. Structural and statistical models are first investigated for spatially interpolating digital images for editing/compositing purposes in object-based video or for frame-based compression systems. Nonlinear filter structures based on vector order statistics are selected for spatially interpolating color images due to their robustness, preservation of edge information and image details, and their ability to exploit the interchannel color correlations. Statistical methods based on Gibbs random field (GRF) models are also used to obtain an interpolated image. The iterative GRF methods are approximated by a non-iterative nonlinear filtering operation, thereby reducing the computational complexity of the process. The proposed methods are all compared to the conventional linear approaches. Temporal prediction of videophone-type sequences is subsequently investigated in view of the problems associated with conventional block-based methods. Segmentation-based motion compensated prediction is carried out by partitioning an image into a facial region and a set of arbitrarily-shaped regions. A novel and useful approach is presented to automatically locate and track the facial area within image sequences using the visual cues of color and shape. The scheme is robust in scenes with head-and-shoulders type images found in videophone sequences or facial image databases, and can be easily tuned for more general cases where many faces may be present in the scene. A general color segmentation technique is then proposed to partition the remainder of the image and form a triangular mesh model. This is subsequently followed by a suitable mesh tracking scheme. Motion compensated prediction is finally performed by utilizing the estimated nodal point motion vectors and an affine warping transformation. Experimental results provide a comparison between the proposed scheme and the conventional block-based methods.
机译:基于对象的视频压缩方案最近引起了极大的兴趣,并且是未来MPEG 4和7视频编码标准的驱动力。这些基于对象的编码器不仅提供了更高的压缩效率,而且还可以为未来的多媒体应用程序支持基于内容的功能。本文针对这些较新的基于对象的压缩方案研究了空间和时间预测技术。首先研究结构模型和统计模型,以便在空间上内插数字图像,以便在基于对象的视频或基于帧的压缩系统中进行编辑/合成。由于其鲁棒性,边缘信息和图像细节的保留以及它们利用通道间颜色相关性的能力,因此选择了基于矢量阶数统计的非线性滤波器结构以用于空间内插彩色图像。基于吉布斯随机场(GRF)模型的统计方法也用于获得插值图像。迭代GRF方法通过非迭代非线性滤波操作近似,从而降低了过程的计算复杂性。所提出的方法都与常规线性方法进行了比较。鉴于与常规基于块的方法相关的问题,随后研究了可视电话类型序列的时间预测。通过将图像划分为面部区域和一组任意形状的区域来执行基于分段的运动补偿预测。提出了一种新颖而有用的方法,可以使用颜色和形状的视觉提示自动定位和跟踪图像序列中的面部区域。该方案在具有可视电话序列或面部图像数据库中的头肩型图像的场景中是可靠的,并且对于场景中可能存在许多面孔的更一般的情况,可以轻松进行调整。然后提出了一种通用的颜色分割技术来分割图像的其余部分,并形成一个三角形网格模型。随后是合适的网格跟踪方案。最后,通过利用估计的节点运动矢量和仿射变形变换来执行运动补偿预测。实验结果提供了所提出的方案与传统的基于块的方法之间的比较。

著录项

  • 作者

    Herodotou, Nicos.;

  • 作者单位

    University of Toronto (Canada).;

  • 授予单位 University of Toronto (Canada).;
  • 学科 Engineering Electronics and Electrical.
  • 学位 Ph.D.
  • 年度 1998
  • 页码 209 p.
  • 总页数 209
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 无线电电子学、电信技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号