Toward semantic indexing and retrieval using hierarchical audio models

Wei-Ta Chu; Wen-Huang Cheng; Jane Yung-Jen Hsu; Ja-Ling Wu

首页> 外文期刊>Multimedia Systems >Toward semantic indexing and retrieval using hierarchical audio models

【24h】

Toward semantic indexing and retrieval using hierarchical audio models

机译：使用分层音频模型实现语义索引和检索

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Semantic-level content analysis is a crucial issue in achieving efficient content retrieval and management. We propose a hierarchical approach that models the statistical characteristics of audio events over a time series to accomplish semantic context detection. Two stages, audio event and semantic context modeling, are devised to bridge the semantic gap between physical audio features and semantic concepts. In this work, hidden Markov models (HMMs) are used to model four representative audio events, i.e., gunshot, explosion, engine, and car-braking, in action movies. At the semantic-context level, Gaussian mixture models (GMMs) and ergodic HMMs are investigated to fuse the characteristics and correlations between various audio events. They provide cues for detecting gunplay and car-chasing scenes, two semantic contexts we focus on in this work. The promising experimental results demonstrate the effectiveness of the proposed approach and exhibit that the proposed framework provides a foundation in semantic indexing and retrieval. Moreover, the two fusion schemes are compared, and the relations between audio event and semantic context are studied.

机译：语义级别的内容分析是实现有效的内容检索和管理的关键问题。我们提出了一种分层方法，该模型可以对时间序列中音频事件的统计特征建模，以完成语义上下文检测。设计了两个阶段，音频事件和语义上下文建模，以弥合物理音频特征和语义概念之间的语义鸿沟。在这项工作中，隐藏的马尔可夫模型（HMM）用于对动作电影中的四个代表性音频事件进行建模，即枪声，爆炸，引擎和汽车制动。在语义上下文级别，研究了高斯混合模型（GMM）和遍历HMM，以融合各种音频事件之间的特性和相关性。它们为检测枪战和购车场景提供了线索，这是我们在本工作中重点关注的两个语义上下文。有希望的实验结果证明了该方法的有效性，并表明该框架为语义索引和检索提供了基础。此外，比较了两种融合方案，研究了音频事件与语义上下文之间的关系。

著录项

来源
《Multimedia Systems》 |2005年第6期|p.570-583|共14页
作者
Wei-Ta Chu; Wen-Huang Cheng; Jane Yung-Jen Hsu; Ja-Ling Wu;
展开▼
作者单位

Department of Computer Science and Information Engineering, National Taiwan University, No. 1, Sec. 4, Roosevelt Road, Taipei, Taiwan 106;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
audio event; semantic context; semantic gap; hidden markov model; gaussian mixture model;

机译：音频事件;语义上下文;语义间隙;隐马尔可夫模型;高斯混合模型;
入库时间 2022-08-18 02:07:14

相似文献

外文文献
中文文献
专利

1. GeoIRIS: Geospatial Information Retrieval and Indexing System—Content Mining, Semantics Modeling, and Complex Queries [J] . Chi-Ren Shyu, Klaric M., Scott G.J., IEEE Transactions on Geoscience and Remote Sensing. . 2007,第4期

机译：GeoIRIS：地理空间信息检索和索引系统-内容挖掘，语义建模和复杂查询
2. GeoIRIS: Geospatial Information Retrieval and Indexing System—Content Mining, Semantics Modeling, and Complex Queries [J] . Chi-Ren Shyu, Matt Klaric, Grant J. Scott, IEEE Transactions on Geoscience and Remote Sensing . 2007,第期

机译：GeoIRIS：地理空间信息检索和索引系统-内容挖掘，语义建模和复杂查询
3. Hierarchical Video Modeling for Indexing and Retrieval Based on MPEG-7 [J] . SHEN Jinhong, Seiya MIYAZAKI, Terumasa AOKI, 電子情報通信学会技術研究報告. 画像工学. Image Engineering . 2003,第514期

机译：基于MPEG-7的索引和检索分层视频建模
4. Hierarchical organization of a set of Gaussian mixture speaker models for scaling up indexing and retrieval in audio documents [C] . J. E. Rougui, M. Rziza, D. Aboutajdine, ACM symposium on Applied computing . 2006

机译：一组高斯混合扬声器模型的分层组织，用于扩大音频文档的索引编制和检索
5. Automatic segmentation, indexing and retrieval of audiovisual data based on combined audio and visual content analysis. [D] . Zhang, Tong. 1999

机译：基于组合的视听内容分析，对视听数据进行自动分段，索引和检索。
6. GeoIRIS: Geospatial Information Retrieval and Indexing System—Content Mining Semantics Modeling and Complex Queries [O] . Chi-Ren Shyu, Matt Klaric, Grant J. Scott, -1

机译：GeoIRIS：地理空间信息检索和索引系统-内容挖掘语义建模和复杂查询
7. Hierarchical semantic indexing for large scale image retrieval [O] . Jia Deng, Alexander C. Berg, Li Fei-fei 2011

机译：分层语义索引用于大规模图像检索
8. MATRIS Indexing and Retrieval Thesaurus (MIRT): Hierarchical List of Indexing Terms. [R] . 1994

机译：maTRIs索引和检索词库（mIRT）：索引术语的分层列表。

Toward semantic indexing and retrieval using hierarchical audio models

摘要

著录项

相似文献

相关主题

期刊订阅