...
首页> 外文期刊>IEEE transactions on multimedia >Integrated Mining of Visual Features, Speech Features, and Frequent Patterns for Semantic Video Annotation
【24h】

Integrated Mining of Visual Features, Speech Features, and Frequent Patterns for Semantic Video Annotation

机译:视觉特征,语音特征和语义视频注释的频繁模式的集成挖掘

获取原文
获取原文并翻译 | 示例

摘要

To support effective multimedia information retrieval, video annotation has become an important topic in video content analysis. Existing video annotation methods put the focus on either the analysis of low-level features or simple semantic concepts, and they cannot reduce the gap between low-level features and high-level concepts. In this paper, we propose an innovative method for semantic video annotation through integrated mining of visual features, speech features, and frequent semantic patterns existing in the video. The proposed method mainly consists of two main phases: 1) Construction of four kinds of predictive annotation models, namely speech-association, visual-association, visual-sequential, and statistical models from annotated videos. 2) Fusion of these models for annotating un-annotated videos automatically. The main advantage of the proposed method lies in that all visual features, speech features, and semantic patterns are considered simultaneously. Moreover, the utilization of high-level rules can effectively complement the insufficiency of statistics-based methods in dealing with complex and broad keyword identification in video annotation. Through empirical evaluation on NIST TRECVID video datasets, the proposed approach is shown to enhance the performance of annotation substantially in terms of precision, recall, and F-measure.
机译:为了支持有效的多媒体信息检索,视频注释已成为视频内容分析中的重要主题。现有的视频注释方法将重点放在分析低级特征或简单语义概念上,并且它们不能缩小低级特征与高级概念之间的差距。在本文中,我们提出了一种通过对视频中存在的视觉特征,语音特征和常见语义模式进行综合挖掘来进行语义视频注释的创新方法。所提出的方法主要包括两个主要阶段:1)构造四种预测注释模型,即语音关联,视觉关联,视觉顺序和来自注释视频的统计模型。 2)这些模型的融合,可以自动注释未注释的视频。所提出的方法的主要优点在于,同时考虑了所有视觉特征,语音特征和语义模式。此外,利用高级规则可以有效地弥补基于统计的方法在处理视频注释中复杂而广泛的关键字识别方面的不足。通过对NIST TRECVID视频数据集的经验评估,所提出的方法在准确性,查全率和F度量方面显示出显着提高注释的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号