Video Scene Retrieval with Symbol Sequence Based on Integrated Audio and Visual Features

机译：基于集成视听特征的带符号序列的视频场景检索

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose a method to retrieve semantically similar scenes to a query video from large scale video databases at high speed. Our method uses the audio features and the color histogram as the visual feature because the audio signal is closely related with the semantic content of videos and the color is an extensively used feature for content-based image retrieval systems. The feature vectors are extracted from video segments called packets and clustered in the feature vector space and transformed into symbols that represent the cluster IDs. Consequently, a video is expressed as a symbol sequence based on audio and visual features. Quick retrieval of similar scenes can be realized by symbol sequence matching. We conduct some experiments using audio, visual, and both features, and examine the effect of each feature on videos of various genres.

机译：在本文中，我们提出了一种从大型视频数据库中高速检索与查询视频语义相似的场景的方法。我们的方法使用音频特征和颜色直方图作为视觉特征，因为音频信号与视频的语义内容密切相关，并且颜色是基于内容的图像检索系统中广泛使用的特征。从称为数据包的视频段中提取特征向量，并在特征向量空间中进行聚类，然后将其转换为表示聚类ID的符号。因此，视频基于音频和视觉特征被表示为符号序列。通过符号序列匹配可以实现对相似场景的快速检索。我们使用音频，视频和这两种功能进行了一些实验，并研究了每种功能对各种类型视频的影响。

著录项

来源
《Multimedia Content Analysis, Management, and Retrieval 2006》|2006年|P.607307.1-607307.10|共10页
会议地点 San Jose CA(US)
作者
Keisuke Morisawa; Naoko Nitta; Noboru Babaguchi;
展开▼
作者单位

Graduate School of Engineering, Osaka University 2-1 Yamadaoka Suita, 565-0871, Japan;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类多媒体技术与多媒体计算机;图像信号处理;
关键词
video scene retrieval; multi-modal analysis; symbol sequence matching;

机译：视频场景检索；多模态分析；符号序列匹配;

相似文献

外文文献
中文文献
专利

1. Temporal Video Segmentation to Scenes Using High-Level Audiovisual Features [J] . Sidiropoulos P., Mezaris V., Kompatsiaris I., Circuits and Systems for Video Technology, IEEE Transactions on . 2011,第8期

机译：使用高级视听功能对场景进行时间视频分割
2. Quick audio retrieval based on histogram feature sequences [J] . Kunio Kashino, Gavin Smith, Hiroshi Murase Acoustical science and technology . 2001,第4期

机译：基于直方图特征序列的快速音频检索
3. Quick audio retrieval based on histogram feature sequences [J] . Gavin Smith, Hiroshi Murase, Kunio Kashino Acoustical science and technology . 2000,第4期

机译：基于直方图特征序列的快速音频检索
4. Video scene retrieval with symbol sequence based on integrated audio and visual features [C] . Keisuke Morisawa, Naoko Nitta, Noboru Babaguchi Conference on Multimedia Content Analysis, Management, and Retrieval . 2006

机译：视频场景根据集成音频和可视特征检索符号序列
5. Automatic segmentation, indexing and retrieval of audiovisual data based on combined audio and visual content analysis. [D] . Zhang, Tong. 1999

机译：基于组合的视听内容分析，对视听数据进行自动分段，索引和检索。
6. Malicious UAV Detection Using Integrated Audio and Visual Features for Public Safety Applications [O] . Sonain Jamil, Fawad, MuhibUr Rahman, 2020

机译：使用集成音频和视觉功能的公共安全应用程序进行恶意无人机检测
7. End-to-end Audio Visual Scene-aware Dialog Using Multimodal Attention-based Video Features [O] . Chiori Hori, Huda Alamri, Jue Wang, 2019

机译：使用基于多模式关注的视频功能的端到端音频视觉场景感知对话框

Video Scene Retrieval with Symbol Sequence Based on Integrated Audio and Visual Features

摘要

著录项

相似文献

相关主题

期刊订阅