...
首页> 外文期刊>IEEE transactions on audio, speech and language processing >A generic audio classification and segmentation approach for multimedia indexing and retrieval
【24h】

A generic audio classification and segmentation approach for multimedia indexing and retrieval

机译:用于多媒体索引和检索的通用音频分类和分段方法

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

We focus the attention on the area of generic and automatic audio classification and segmentation for audio-based multimedia indexing and retrieval applications. In particular, we present a fuzzy approach toward hierarchic audio classification and global segmentation framework based on automatic audio analysis providing robust, bi-modal, efficient and parameter invariant classification over global audio segments. The input audio is split into segments, which are classified as speech, music, fuzzy or silent. The proposed method minimizes critical errors of misclassification by fuzzy region modeling, thus increasing the efficiency of both pure and fuzzy classification. The experimental results show that the critical errors are minimized and the proposed framework significantly increases the efficiency and the accuracy of audio-based retrieval especially in large multimedia databases.
机译:我们将注意力集中在基于音频的多媒体索引和检索应用程序的通用和自动音频分类和分段领域。特别是,我们提出了一种基于自动音频分析的层次化音频分类和全局分段框架的模糊方法,该方法在全局音频片段上提供了鲁棒,双峰,高效和参数不变的分类。输入音频分为多个片段,分为语音,音乐,模糊或无声。所提出的方法通过模糊区域建模使错误分类的关键错误最小化,从而提高了纯分类和模糊分类的效率。实验结果表明,关键错误被最小化,并且所提出的框架显着提高了基于音频的检索的效率和准确性,尤其是在大型多媒体数据库中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号