HMM-Based Audio Keyword Generation

机译：基于赫姆的音频关键字生成

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

With the exponential growth in the production creation of multimedia data, there is an increasing need for video semantic analysis. Audio, as a significant part of video, provides important cues to human perception when humans are browsing and understanding video contents. To detect semantic content by useful audio information, we introduce audio keywords which are sets of specific audio sounds related to semantic events. In our previous work, we designed a hierarchical Support Vector Machine (SVM) classifier for audio keyword identification. However, a weakness of our previous work is that audio signals are artificially segmented into 20 ms frames for frame-based SVM identification without any contextual information. In this paper, we propose a classification method based on Hidden Markov Modal (HMM) for audio keyword identification as an improved work instead of using hierarchical SVM classifier. Choosing HMM is motivated by the successful story of HMM in speech recognition. Unlike the frame-based SVM classification followed by major voting, our proposed HMM-based classifiers treat specific sound as a continuous time series data and employ hidden states transition to capture context information. In particular, we study how to find an effective HMM, i.e., determining topology, observation vectors and statistical parameters of HMM. We also compare different HMM structures with different hidden states, and adjust time series data with variable length. Experimental data includes 40 minutes basketball au-dio which comes from real-time sports games. Experimental results show that, for audio keyword generation, the proposed HMM-based method outperforms the previous hierarchical SVM.

机译：随着生产创建多媒体数据的指数增长，越来越需要视频语义分析。当人类正在浏览和了解视频内容时，音频是视频的重要组成部分，为人类感知提供了重要的提示。要通过有用的音频信息检测语义内容，我们介绍了与语义事件相关的特定音频声音集的音频关键字。在我们以前的工作中，我们设计了一个用于音频关键字标识的分层支持向量机（SVM）分类器。然而，我们以前的工作的弱点是音频信号在没有任何上下文信息的情况下为基于帧的SVM标识被人工地分割成20 ms帧。在本文中，我们提出了一种基于隐马尔可夫模态（HMM）的分类方法，用于音频关键字标识作为改进的工作而不是使用分层SVM分类器。选择嗯，由语音识别中的嗯的成功故事是激励的。与基于帧的SVM分类不同，随后是主要投票，我们提出的基于赫姆的分类器将特定的声音视为连续时间序列数据，并使用隐藏状态转换以捕获上下文信息。特别地，我们研究了如何找到有效的HMM，即确定拓扑，观察载体和HMM的统计参数。我们还将不同的HMM结构与不同的隐藏状态进行比较，并使用可变长度调整时间序列数据。实验数据包括来自实时体育游戏的40分钟篮球Au-Dio。实验结果表明，对于音频关键字生成，所提出的基于HMM的方法优于前一个分层SVM。

著录项

来源
《Pacific Rim Conference on Multimedia》|2004年||共9页
会议地点
作者
Min Xu; Ling-Yu Duan; Jianfei Cai; Liang-Tien Chia; Changsheng Xu; Qi Tian;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类多媒体技术与多媒体计算机;
关键词

相似文献

外文文献
中文文献
专利

1. Unsupervised training of an HMM-based self-organizing unit recognizer with applications to topic classification and keyword discovery [J] . Man-hung Siu, Herbert Gish, Arthur Chan, Computer speech and language . 2014,第1期

机译：基于HMM的自组织单元识别器的无监督训练，并应用于主题分类和关键字发现
2. Performance evaluation for an HMM-based keyword spotter and a large-margin based one in noisy environments [J] . Shima Tabibian, Akram Shokri, Ahmad Akbari, Procedia Computer Science . 2011,第1期

机译：在嘈杂的环境中对基于HMM的关键字搜索器和基于大利润的关键字搜索器的性能评估
3. Audio Keywords Discovery for Text-Like Audio Content Analysis and Retrieval [J] . Lu L., Hanjalic A. IEEE transactions on multimedia . 2008,第1期

机译：音频关键字发现，可进行类似文本的音频内容分析和检索
4. HMM-Based Audio Keyword Generation [C] . Min Xu, Ling-Yu Duan, Jianfei Cai, Pacific Rim Conference on Multimedia(PCM 2004) pt.3; 20041130-1203; Tokyo(JP) . 2004

机译：基于HMM的音频关键字生成
5. Profile HMM-based protein domain analysis of next-generation sequencing data. [D] . Zhang, Yuan. 2013

机译：基于HMM的配置文件的下一代测序数据的蛋白质结构域分析。
6. Development of Next Generation Stevia Sweetener: Rebaudioside M [O] . Indra Prakash, Avetik Markosyan, Cynthia Bunders 2014

机译：下一代甜菊甜味剂的开发：莱鲍迪甙M
7. Performance evaluation for an HMM-based keyword spotter and a large-margin based one in noisy environments [O] . Tabibian Shima, Shokri Akram, Akbari Ahmad, 2011

机译：在嘈杂的环境中对基于HMM的关键字搜索器和基于大利润的关键字搜索器的性能评估
8. Rich System Combination For Keyword Spotting In Noisy and Acoustically Heterogeneous Audio Streams. [R] . Akbacak, M., Burget, L., Wang, W., 2013

机译：用于噪声和声学异构音频流中关键字定位的丰富系统组合。

HMM-Based Audio Keyword Generation

摘要

著录项

相似文献

相关主题

期刊订阅