A generic audio classification and segmentation approach for multimedia indexing and retrieval

Kiranyaz S.; Ahmad Farooq Qureshi; Gabbouj M.

首页> 外文期刊>IEEE transactions on audio, speech and language processing >A generic audio classification and segmentation approach for multimedia indexing and retrieval

【24h】

A generic audio classification and segmentation approach for multimedia indexing and retrieval

机译：用于多媒体索引和检索的通用音频分类和分段方法

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We focus the attention on the area of generic and automatic audio classification and segmentation for audio-based multimedia indexing and retrieval applications. In particular, we present a fuzzy approach toward hierarchic audio classification and global segmentation framework based on automatic audio analysis providing robust, bi-modal, efficient and parameter invariant classification over global audio segments. The input audio is split into segments, which are classified as speech, music, fuzzy or silent. The proposed method minimizes critical errors of misclassification by fuzzy region modeling, thus increasing the efficiency of both pure and fuzzy classification. The experimental results show that the critical errors are minimized and the proposed framework significantly increases the efficiency and the accuracy of audio-based retrieval especially in large multimedia databases.

机译：我们将注意力集中在基于音频的多媒体索引和检索应用程序的通用和自动音频分类和分段领域。特别是，我们提出了一种基于自动音频分析的层次化音频分类和全局分段框架的模糊方法，该方法在全局音频片段上提供了鲁棒，双峰，高效和参数不变的分类。输入音频分为多个片段，分为语音，音乐，模糊或无声。所提出的方法通过模糊区域建模使错误分类的关键错误最小化，从而提高了纯分类和模糊分类的效率。实验结果表明，关键错误被最小化，并且所提出的框架显着提高了基于音频的检索的效率和准确性，尤其是在大型多媒体数据库中。

著录项

来源
《IEEE transactions on audio, speech and language processing》 |2006年第3期|p.1062-1081|共20页
作者
Kiranyaz S.; Ahmad Farooq Qureshi; Gabbouj M.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
audio databases; database indexing; fuzzy set theory; information retrieval; multimedia databases; automatic audio segmentation; fuzzy approach; fuzzy region modeling; generic audio classification; multimedia databases; multimedia indexing; multimedia retrieval; Aut;

机译：音频数据库;数据库索引;模糊集理论;信息检索;多媒体数据库;自动音频分割;模糊方法;模糊区域建模;通用音频分类;多媒体数据库;多媒体索引;多媒体检索;自动;

相似文献

外文文献
中文文献
专利

1. Audio indexing: primary components retrieval Robust classification in audio documents [J] . Julien Pinquier, Regine Andre-Obrecht Multimedia Tools and Applications . 2006,第3期

机译：音频索引：主要组件检索音频文档中的稳健分类
2. Generic content-based audio indexing and retrieval framework [J] . S. Kiranyaz, M. Gabbouj IEE proceedings, Part K. Vision, image and signal processing . 2006,第3期

机译：基于通用内容的音频索引和检索框架
3. Generic content-based audio indexing and retrieval framework [J] . S. Kiranyaz, M. Gabbouj IEE Proceedings. Part K, Vision, image and signal processing . 2006,第3期

机译：基于通用内容的音频索引和检索框架
4. Content-Based Audio Classification and Retrieval Using Segmentation, Feature Extraction and Neural Network Approach [C] . Nilesh M. Patil, Milind U. Nemade International Conference on Computer, Communication and Computational Sciences . 2019

机译：基于内容的音频分类和使用分段，特征提取和神经网络方法检索
5. Automatic segmentation, indexing and retrieval of audiovisual data based on combined audio and visual content analysis. [D] . Zhang, Tong. 1999

机译：基于组合的视听内容分析，对视听数据进行自动分段，索引和检索。
6. A Semantic Medical Multimedia Retrieval Approach Using Ontology Information Hiding [O] . Kehua Guo, Shigeng Zhang 2013

机译：基于本体信息隐藏的语义医学多媒体检索方法
7. A generic audio classification and segmentation approach for multimedia indexing and retrieval [O] . Kiranyaz, S, Qureshi, AF, Gabbouj, M 2006

机译：用于多媒体索引和检索的通用音频分类和分段方法

A generic audio classification and segmentation approach for multimedia indexing and retrieval

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅