...
首页> 外文期刊>International Journal of Intelligent Systems >A Flexible and Scalable Audio Information Retrieval System for Mixed-Type Audio Signals
【24h】

A Flexible and Scalable Audio Information Retrieval System for Mixed-Type Audio Signals

机译:灵活且可扩展的混合类型音频信号音频信息检索系统

获取原文
获取原文并翻译 | 示例
           

摘要

The content-based classification and retrieval of real-world audio clips is one of the challenging tasks in multimedia information retrieval. Although the problem has been well studied in the last two decades, most of the current retrieval systems cannot provide flexible querying of audio clips due to the mixed-type form (e.g., speech over music and speech over environmental sound) of audio information in real world. We present here a complete, scalable, and extensible content-based classification and retrieval system for mixed-type audio clips. The system gives users an opportunity for flexible querying of audio data semantically by providing four alternative ways, namely, querying by mixed-type audio classes, querying by domain-based fuzzy classes, querying by temporal information and temporal relationships, and querying by example (QBE). In order to reduce the retrieval time, a hash-based indexing technique is introduced. Two kinds of experiments were conducted on the audio tracks of the TRECVID news broadcasts to evaluate the performance of the proposed system. The results obtained from our experiments demonstrate that the Audio Spectrum Flatness feature in MPEG-7 standard performs better in music audio samples compared to other kinds of audio samples and the system is robust under different conditions.
机译:基于内容的现实世界音频剪辑的分类和检索是多媒体信息检索中具有挑战性的任务之一。尽管在过去的二十年中已经对该问题进行了深入研究,但是由于实际的音频信息的混合类型形式(例如,基于音乐的语音和基于环境声音的语音),当前大多数检索系统无法提供灵活的音频片段查询世界。在这里,我们为混合类型的音频剪辑提供了一个完整,可扩展且可扩展的基于内容的分类和检索系统。该系统通过提供四种替代方式为用户提供语义上灵活地查询音频数据的机会,即通过混合类型音频类别进行查询,通过基于域的模糊类别进行查询,通过时间信息和时间关系进行查询以及通过示例进行查询( QBE)。为了减少检索时间,引入了基于散列的索引技术。在TRECVID新闻广播的音轨上进行了两种实验,以评估所提出系统的性能。从我们的实验中获得的结果表明,与其他类型的音频样本相比,MPEG-7标准中的“音频频谱平坦度”功能在音乐音频样本中表现更好,并且该系统在不同条件下具有鲁棒性。

著录项

  • 来源
    《International Journal of Intelligent Systems》 |2011年第10期|p.952-970|共19页
  • 作者单位

    Communications Division, ASELSAN Electronics Industries Inc., Ankara, Turkey;

    Department of Computer Engineering, Ba§kent University, Ankara, Turkey;

    Department of Computer Engineering, Middle East Technical University, Ankara, Turkey;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号