首页> 外文学位 >Clustering methods for content-based video analysis.
【24h】

Clustering methods for content-based video analysis.

机译:基于内容的视频分析的聚类方法。

获取原文
获取原文并翻译 | 示例

摘要

In this thesis, we present a clustering-based framework for efficient, unsupervised video content analysis and management. The proposed algorithms implement the components of a complete visual content management system, and provide such key functionalities as navigation/browsing, visual summarization/compaction, color-based retrieval, and (pseudo-)semantic annotation. The group-of-frames video representation scheme and color descriptors that we introduce have become normative elements of the new multimedia standard MPEG-7, and the proposed analysis tools can be used to generate components of MPEG-7-compliant content descriptions.; We first introduce a crisp clustering algorithm to identify, in near real-time and without the need for threshold selection, the individual shots within a video sequence. Histogram-based descriptors are then defined for joint representation of the color features of a collection of frames. The family of alpha-trimmed average histograms provide a robust set of color descriptors that can eliminate the effects of aberrant frames within a shot or video segment. The intersection histogram, on the other hand, reflects the number of pixels of a given color that is common to all frames of interest.; These color descriptors are subsequently utilized as part of a fuzzy clustering algorithm to determine and extract the optimum, non-redundant set of representative frames for each segment in a sequence. The approach utilizes associated components of the MPEG-7 standard, and generates hierarchical video frame extraction process is largely unsupervised, and the resulting visual summary can be dynamically updated according to user preferences during a browsing session. In addition to these generic, domain-independent methods, a fuzzy classification technique is developed for unsupervised, domain-dependent video sequence analysis and shot labeling. While the algorithm requires pre-computed reference models that characterize each domain, these models are established (semi-)automatically during a training stage, again through the use of fuzzy clustering and cluster validation methods. A fuzzy nearest-neighbor/prototype classifier is then used to assign unlabeled video shots to the available classes. The fuzzy membership values associated with each shot allow erroneous classifications to be avoided when an input sample lacks strong affiliation with any of the categories.
机译:在本文中,我们提出了一种基于集群的框架,用于高效,无监督的视频内容分析和管理。所提出的算法实现了完整的视觉内容管理系统的组件,并提供了诸如导航/浏览,视觉摘要/压缩,基于颜色的检索以及(伪)语义注释之类的关键功能。我们介绍的帧组视频表示方案和颜色​​描述符已成为新的多媒体标准MPEG-7的规范元素,并且所提出的分析工具可用于生成符合MPEG-7的内容描述的组件。我们首先引入一种清晰的聚类算法,以几乎实时的方式识别视频序列中的各个镜头,而无需进行阈值选择。然后定义基于直方图的描述符,以联合表示帧集合的颜色特征。 alpha修剪的平均直方图系列提供了一组可靠的颜色描述符,可以消除镜头或视频片段中异常帧的影响。另一方面,相交直方图反映了所有感兴趣帧共有的给定颜色的像素数。这些颜色描述符随后被用作模糊聚类算法的一部分,以确定并提取序列中每个片段的代表帧的最佳,非冗余最佳集合。该方法利用了MPEG-7标准的相关组件,并且在很大程度上不受监督地生成分层视频帧提取过程,并且可以根据用户在浏览会话期间的偏好来动态更新所得的视觉摘要。除了这些通用的,与域无关的方法之外,还开发了一种模糊分类技术,用于无监督,与域相关的视频序列分析和镜头标记。虽然该算法需要表征每个域的预先计算的参考模型,但这些模型是在训练阶段自动(半)自动建立的,方法是再次使用模糊聚类和聚类验证方法。然后,使用模糊最近邻/原型分类器将未标记的视频镜头分配给可用的类。当输入样本缺乏与任何类别的强隶属关系时,与每个镜头相关的模糊隶属度值可以避免错误分类。

著录项

  • 作者

    Ferman, Ahmet Mufit.;

  • 作者单位

    The University of Rochester.;

  • 授予单位 The University of Rochester.;
  • 学科 Engineering Electronics and Electrical.
  • 学位 Ph.D.
  • 年度 2001
  • 页码 101 p.
  • 总页数 101
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 无线电电子学、电信技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号