Clustering methods for content-based video analysis.

机译：基于内容的视频分析的聚类方法。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this thesis, we present a clustering-based framework for efficient, unsupervised video content analysis and management. The proposed algorithms implement the components of a complete visual content management system, and provide such key functionalities as navigation/browsing, visual summarization/compaction, color-based retrieval, and (pseudo-)semantic annotation. The group-of-frames video representation scheme and color descriptors that we introduce have become normative elements of the new multimedia standard MPEG-7, and the proposed analysis tools can be used to generate components of MPEG-7-compliant content descriptions.; We first introduce a crisp clustering algorithm to identify, in near real-time and without the need for threshold selection, the individual shots within a video sequence. Histogram-based descriptors are then defined for joint representation of the color features of a collection of frames. The family of alpha-trimmed average histograms provide a robust set of color descriptors that can eliminate the effects of aberrant frames within a shot or video segment. The intersection histogram, on the other hand, reflects the number of pixels of a given color that is common to all frames of interest.; These color descriptors are subsequently utilized as part of a fuzzy clustering algorithm to determine and extract the optimum, non-redundant set of representative frames for each segment in a sequence. The approach utilizes associated components of the MPEG-7 standard, and generates hierarchical video frame extraction process is largely unsupervised, and the resulting visual summary can be dynamically updated according to user preferences during a browsing session. In addition to these generic, domain-independent methods, a fuzzy classification technique is developed for unsupervised, domain-dependent video sequence analysis and shot labeling. While the algorithm requires pre-computed reference models that characterize each domain, these models are established (semi-)automatically during a training stage, again through the use of fuzzy clustering and cluster validation methods. A fuzzy nearest-neighbor/prototype classifier is then used to assign unlabeled video shots to the available classes. The fuzzy membership values associated with each shot allow erroneous classifications to be avoided when an input sample lacks strong affiliation with any of the categories.

机译：在本文中，我们提出了一种基于集群的框架，用于高效，无监督的视频内容分析和管理。所提出的算法实现了完整的视觉内容管理系统的组件，并提供了诸如导航/浏览，视觉摘要/压缩，基于颜色的检索以及（伪）语义注释之类的关键功能。我们介绍的帧组视频表示方案和颜色描述符已成为新的多媒体标准MPEG-7的规范元素，并且所提出的分析工具可用于生成符合MPEG-7的内容描述的组件。我们首先引入一种清晰的聚类算法，以几乎实时的方式识别视频序列中的各个镜头，而无需进行阈值选择。然后定义基于直方图的描述符，以联合表示帧集合的颜色特征。 alpha修剪的平均直方图系列提供了一组可靠的颜色描述符，可以消除镜头或视频片段中异常帧的影响。另一方面，相交直方图反映了所有感兴趣帧共有的给定颜色的像素数。这些颜色描述符随后被用作模糊聚类算法的一部分，以确定并提取序列中每个片段的代表帧的最佳，非冗余最佳集合。该方法利用了MPEG-7标准的相关组件，并且在很大程度上不受监督地生成分层视频帧提取过程，并且可以根据用户在浏览会话期间的偏好来动态更新所得的视觉摘要。除了这些通用的，与域无关的方法之外，还开发了一种模糊分类技术，用于无监督，与域相关的视频序列分析和镜头标记。虽然该算法需要表征每个域的预先计算的参考模型，但这些模型是在训练阶段自动（半）自动建立的，方法是再次使用模糊聚类和聚类验证方法。然后，使用模糊最近邻/原型分类器将未标记的视频镜头分配给可用的类。当输入样本缺乏与任何类别的强隶属关系时，与每个镜头相关的模糊隶属度值可以避免错误分类。

著录项

作者
Ferman, Ahmet Mufit.;
展开▼
作者单位

The University of Rochester.;

展开▼
授予单位 The University of Rochester.;
学科 Engineering Electronics and Electrical.
学位 Ph.D.
年度 2001
页码 101 p.
总页数 101
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. A Representative-Video-Frame Selection Method for a Content-Based Video-Query-Agent System [J] . Katsunobu FUSHIKIDA, Yoshitsugu HIWATRI, Hideyo WAKIT IEICE Transactions on Information and Systems . 2000,第6期

机译：基于内容的视频查询代理系统的代表性视频帧选择方法
2. An experimental comparison of clustering methods for content-based indexing of large image databases [J] . Hien Phuong Lai, Muriel Visani, Alain Boucher, Pattern Analysis and Applications . 2012,第4期

机译：基于内容的大型图像数据库索引聚类方法的实验比较
3. An Improved Method for Eliminating False Matches in Content-Based Video Copy Detection System [J] . Mingyang Liu, Shengsheng Wang, Bin Zhao Journal of computational and theoretical nanoscience . 2016,第3期

机译：一种用于消除基于内容的视频复制检测系统的假匹配的改进方法
4. New indexing method for content-based video retrieval and clustering for MPEG video database [C] . Zaher Aghbari, Kunihiko Kaneko, Akifumi Makinouchi International symposium on digital media information base . 1998

机译：基于内容的视频检索和MPEG视频数据库群集的新索引方法
5. Effective temporal video segmentation and content-based audio-visual video clustering. [D] . Kang, Jung Won. 2003

机译：有效的时间视频分割和基于内容的视听视频聚类。
6. HealthRecSys: A semantic content-based recommender system to complement health videos [O] . Carlos Luis Sanchez Bocanegra, Jose Luis Sevillano Ramos, Carlos Rizo, 2017

机译：HealthRecSys：基于语义内容的推荐系统可补充健康视频
7. Content-Based Indexing of Images and Video Using Face Detection and Recognition Methods [O] . Stefan Eickeler, Frank Wallhoff, Uri Iurgel, 2001

机译：使用面部检测和识别方法的基于内容的图像和视频索引

Clustering methods for content-based video analysis.

摘要

著录项

相似文献

相关主题

期刊订阅