首页> 外文学位 >Content-based retrieval of arbitrarily shaped video objects in the uncompressed and compressed domains.
【24h】

Content-based retrieval of arbitrarily shaped video objects in the uncompressed and compressed domains.

机译:在未压缩和压缩域中基于内容的任意形状视频对象的检索。

获取原文
获取原文并翻译 | 示例

摘要

Advancements in video object segmentation technology and the availability of efficient object-based video representations, such as MPEG-4 [1], have resulted in the increased availability of arbitrarily shaped digital video content. While this enables many exciting applications, the process of locating and accessing a desired video sequence can still be challenging because of the large volume of data associated with even compressed video.; This dissertation proposes generic methods for the retrieval of arbitrarily shaped video objects in the MPEG-4 compressed domain, using their shape, local motion, and color content. Considering that a one-minute long video sequence may contain more than 1,500 frames, summarization of video content is necessary as a first step to efficiently retrieve video. Therefore, we first suggest a method for the summarization of arbitrarily shaped video objects. This is achieved by selecting the temporal instants of video objects—based on their compressed domain shape information—that efficiently represent the objects' salient content.; Next, we propose to extend some well-proven still shape retrieval techniques to retrieve video objects in the compressed domain. We compute the Fourier and ART (Angular Radial Transform) descriptors on the shape approximations obtained from the MPEG-4 shape coding modes. We also present a method to compute the shape distances between two video objects based on these still shape features.; Unlike in the case of still objects, one of the key features that describe a video object is motion. Classification of video objects by their local motion is addressed in this thesis by presenting three new motion descriptors. These descriptors are computed based on the shape deformations of arbitrarily shaped video, and assume no prior knowledge about the video content.; Color is one of the most widely used low level features in content-based retrieval. In this thesis, we also study efficient color content matching of arbitrarily shaped video, and in particular, color histogram computation in the MPEG-4 compressed domain.; Our experimental results demonstrate that our techniques enable effective and low complexity content-based retrieval. Employing MPEG-4 compressed domain information not only obviates the need for full decompression of the bit stream, hence yielding substantial computational savings, but also allows our techniques to be more robust to segmentation errors.
机译:视频对象分割技术的进步以及有效的基于对象的视频表示形式(例如MPEG-4 [1])的出现,导致了任意形状的数字视频内容的可用性增加。尽管这使许多激动人心的应用成为可能,但是定位和访问所需视频序列的过程仍然具有挑战性,因为与压缩视频关联的数据量也很大。本文提出了利用MPEG-4压缩域的形状,局部运动和颜色含量来检索任意形状的视频对象的通用方法。考虑到一分钟长的视频序列可能包含多于1,500帧,因此必须对视频内容进行汇总,以作为有效检索视频的第一步。因此,我们首先提出一种用于汇总任意形状视频对象的方法。这是通过基于视频对象的压缩域形状信息选择视频对象的瞬时来实现的,该瞬时实例有效地表示了对象的显着内容。接下来,我们建议扩展一些经过验证的静止形状检索技术,以在压缩域中检索视频对象。我们根据从MPEG-4形状编码模式获得的形状近似值来计算傅立叶和ART(角度径向变换)描述符。我们还提出了一种基于这些静止形状特征来计算两个视频对象之间的形状距离的方法。与静止对象不同,描述视频对象的关键特征之一是运动。本文通过提出三个新的运动描述符来解决视频对象按局部运动的分类问题。这些描述符是根据任意形状的视频的形状变形来计算的,并且不具有关于视频内容的先验知识。颜色是基于内容的检索中使用最广泛的低级功能之一。在本文中,我们还研究了任意形状视频的有效颜色内容匹配,尤其是MPEG-4压缩域中的颜色直方图计算。我们的实验结果表明,我们的技术可以实现有效且低复杂度的基于内容的检索。采用MPEG-4压缩域信息不仅消除了对比特流进行完全解压缩的需要,从而节省了大量计算量,而且还使我们的技术对分段错误更加健壮。

著录项

  • 作者

    Erol, Berna.;

  • 作者单位

    The University of British Columbia (Canada).;

  • 授予单位 The University of British Columbia (Canada).;
  • 学科 Engineering Electronics and Electrical.
  • 学位 Ph.D.
  • 年度 2002
  • 页码 169 p.
  • 总页数 169
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 无线电电子学、电信技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号