【24h】

Multiscale Content Extraction and Representation for Video Indexing

机译:视频索引的多尺度内容提取与表示

获取原文

摘要

This paper presents a general multiscale framework for extraction and representation of video content. The approach exploits the inherent multiscale nature of many TV and film productions to delineate an input stream effectively and to construct consistent scenes reliably. The method first utilizes basic signal processing techniques (i.e, temporal sampling, local windowing, mean and median filtering), and unsupervised clustering to determine shot boundaries in the video sequence. Similarity comparison using shot representative histograms and clustering is then carried out within each shot to automatically select representative key frames. Finally, a model that takes into account the filmic structure of the input stream is discussed and developed to efficiently merge individual shots into coherent, meaningful segments, i.e. scenes.
机译:本文介绍了一种用于提取和表示视频内容的一般多尺度框架。该方法利用许多电视和电影制作的固有的多尺度性质,以有效地描绘输入流,并可靠地构造一致的场景。该方法首先利用基本信号处理技术(即,时间采样,本地窗口,均值和中值滤波),以及无监督的聚类来确定视频序列中的截图边界。然后,使用拍摄代表直方图和聚类的相似性比较然后在每个镜头内执行以自动选择代表性的关键帧。最后,讨论并开发了一种考虑到输入流的胶片结构的模型,以便有效地将单个射击合并成连贯,有意义的段,即场景。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号