Integrated Mining of Visual Features, Speech Features, and Frequent Patterns for Semantic Video Annotation

Tseng V.S.; Ja-Hwung Su; Jhih-Hong Huang; Chih-Jen Chen

首页> 外文期刊>IEEE transactions on multimedia >Integrated Mining of Visual Features, Speech Features, and Frequent Patterns for Semantic Video Annotation

【24h】

Integrated Mining of Visual Features, Speech Features, and Frequent Patterns for Semantic Video Annotation

机译：视觉特征，语音特征和语义视频注释的频繁模式的集成挖掘

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

To support effective multimedia information retrieval, video annotation has become an important topic in video content analysis. Existing video annotation methods put the focus on either the analysis of low-level features or simple semantic concepts, and they cannot reduce the gap between low-level features and high-level concepts. In this paper, we propose an innovative method for semantic video annotation through integrated mining of visual features, speech features, and frequent semantic patterns existing in the video. The proposed method mainly consists of two main phases: 1) Construction of four kinds of predictive annotation models, namely speech-association, visual-association, visual-sequential, and statistical models from annotated videos. 2) Fusion of these models for annotating un-annotated videos automatically. The main advantage of the proposed method lies in that all visual features, speech features, and semantic patterns are considered simultaneously. Moreover, the utilization of high-level rules can effectively complement the insufficiency of statistics-based methods in dealing with complex and broad keyword identification in video annotation. Through empirical evaluation on NIST TRECVID video datasets, the proposed approach is shown to enhance the performance of annotation substantially in terms of precision, recall, and F-measure.

机译：为了支持有效的多媒体信息检索，视频注释已成为视频内容分析中的重要主题。现有的视频注释方法将重点放在分析低级特征或简单语义概念上，并且它们不能缩小低级特征与高级概念之间的差距。在本文中，我们提出了一种通过对视频中存在的视觉特征，语音特征和常见语义模式进行综合挖掘来进行语义视频注释的创新方法。所提出的方法主要包括两个主要阶段：1）构造四种预测注释模型，即语音关联，视觉关联，视觉顺序和来自注释视频的统计模型。 2）这些模型的融合，可以自动注释未注释的视频。所提出的方法的主要优点在于，同时考虑了所有视觉特征，语音特征和语义模式。此外，利用高级规则可以有效地弥补基于统计的方法在处理视频注释中复杂而广泛的关键字识别方面的不足。通过对NIST TRECVID视频数据集的经验评估，所提出的方法在准确性，查全率和F度量方面显示出显着提高注释的性能。

著录项

来源
《IEEE transactions on multimedia》 |2008年第2期|p.260-267|共8页
作者
Tseng V.S.; Ja-Hwung Su; Jhih-Hong Huang; Chih-Jen Chen;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
data mining; feature extraction; information retrieval; multimedia communication; speech processing; statistical analysis; video signal processing; frequent semantic pattern; integrated mining; keyword identification; multimedia information retrieval; speech-associ;

机译：数据挖掘;特征提取;信息检索;多媒体通信;语音处理;统计分析;视频信号处理;频繁语义模式;综合挖掘;关键词识别;多媒体信息检索;语音关联;

相似文献

外文文献
中文文献
专利

1. Bimodal fusion of low-level visual features and high-level semantic features for near-duplicate video clip detection [J] . Hyun-seok Min, Jae Young Choi, Wesley De Neve, Signal Processing. Image Communication: A Publication of the the European Association for Signal Processing . 2011,第10期

机译：低级视觉特征和高级语义特征的双峰融合，用于近乎重复的视频剪辑检测
2. Integrating global and local visual features with semantic hierarchies for two-level image annotation [J] . Qian Zhiming, Zhong Ping, Chen Jia Neurocomputing . 2016,第JANa1期

机译：将全局和局部视觉功能与语义层次结构集成在一起以进行两级图像注释
3. Modeling continuous visual features for semantic image annotation and retrieval [J] . Zhixin Li, Zhiping Shi, Xi Liu, Pattern recognition letters . 2011,第3期

机译：为语义图像注释和检索建模连续的视觉特征
4. Semantic Video Annotation by Mining Association Patterns from Visual and Speech Features [C] . Vincent S. Tseng, Ja-Hwung Su, Jhih-Hong Huang, Advances in Knowledge Discovery and Data Mining . 2008

机译：从视觉和语音特征中挖掘关联模式的语义视频注释
5. Combining Visual Features and Contextual Information for Image Retrieval and Annotation. [D] . Zhang, Rui. 2011

机译：结合视觉特征和上下文信息进行图像检索和注释。
6. Genetic Programming and Frequent Itemset Mining to Identify Feature Selection Patterns of iEEG and fMRI Epilepsy Data [O] . Otis Smart, Lauren Burrell -1

机译：遗传程序设计和频繁项集挖掘以识别iEEG和fMRI癫痫数据的特征选择模式
7. Genetic programming and frequent itemset mining to identify feature selection patterns of iEEG and fMRI epilepsy data [O] . Otis Smart, Lauren Burrell 2015

机译：遗传编程和频繁的项目集挖掘，以确定IEEG和FMRI癫痫数据的特征选择模式
8. Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments [R] . Gimpel, K., Schneider, N., O'Connor, B., 2010

机译：Twitter的词性标注：注释，功能和实验

Integrated Mining of Visual Features, Speech Features, and Frequent Patterns for Semantic Video Annotation

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅