首页> 外文会议>International workshops on interactive distributed multimedia systems >Accuracy vs. Speed Trade-Off in Detecting of Shots in Video Content for Abstracting Digital Video Libraries
【24h】

Accuracy vs. Speed Trade-Off in Detecting of Shots in Video Content for Abstracting Digital Video Libraries

机译:在抽象数字视频库中检测视频内容中镜头的精度与​​速度折衷

获取原文

摘要

Two basic requirements for a digital video library to be "browsable" are a precisely indexed content and informative abstracts. Nowadays such solutions are not common in video search engines or generic digital video platforms, therefore, the authors suggest developing some computer applications resolving the problems of at least abstracts' creation. The abstracts cannot be constructed without a deep video content analysis, including some low level processing like a shot detection towards a video sequence segmented to a series of "camera takes". The presented method, aimed at a shot detection, deploys a concept of a Motion Factor (of frame transitions). The basic definition considers the motion factor as a very sudden peak of difference between two successive frames. In some specific areas, the intrashot motion factor may suppress the shot-boundary motion factor. In order to avoid misrecognition of both motion factors during a shot detection process a concept of a differential motion factor was implemented. The full-resolution algorithm achieves the accuracy of up to 80%, however, it is very time-consuming. The shot detection accuracy was measured including true and false shots detected as well as real shots that were bounded visually. The authors' research of a representative number of movies (from various categories) has revealed that the shot detection process can be accelerated up to 500 times without any significant deterioration of shot recognition accuracy. The shot detection algorithm was accelerated in a simple manner by two-dimensional reduction of a frame resolution (in pixels).
机译:数字视频库的两个基本要求是“可浏览”是一个精确索引的内容和信息摘要。如今,这种解决方案在视频搜索引擎或通用数字视频平台中不常见,因此,作者建议开发一些计算机应用程序解决至少摘要创建的问题。在没有深度视频内容分析的情况下无法构建摘要,包括一些低级处理,如朝向视频序列的镜头检测,分段为一系列“相机”。针对镜头检测的呈现方法部署了运动因子(帧转换)的概念。基本定义认为运动因子是两个连续帧之间的突然差异的突然峰值。在一些特定区域中,intrashot运动因子可以抑制射击边界运动因子。为了避免在拍摄检测过程中避免两个运动因素的误导,实现了差分运动因子的概念。全分辨率算法达到高达80%的准确性,但是,它非常耗时。测量镜头检测精度,包括检测到的真实射击以及视觉上界定的真实镜头。作者对代表性的电影(来自各种类别)的研究透露,镜头检测过程可以加速高达500倍,而不会出现射击识别精度的任何显着恶化。通过帧分辨率的二维减少(以像素为单位),以简单的方式加速镜头检测算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号