首页> 外文会议>International Symposium on Computer Music Multidisciplinary Research >Automatic Recognition of Sound Categories from Their Vocal Imitation Using Audio Primitives Automatically Found by SI-PLCA and HMM
【24h】

Automatic Recognition of Sound Categories from Their Vocal Imitation Using Audio Primitives Automatically Found by SI-PLCA and HMM

机译:使用SI-PLCA和HMM自动发现的音频基元自动识别声音模仿的声音类别

获取原文

摘要

In this paper we study the automatic recognition of sound categories (such as fridge, mixers or sawing sounds) from their vocal imitations. Vocal imitations are made of a succession over time of sounds produced using vocal mechanisms that can largely differ from the ones used in speech. We develop here a recognition approach inspired by automatic-speech-recognition systems, with an acoustic model (that maps the audio signal to a set of probability over "phonemes") and a language model (that represents the expected succession of "phonemes" for each sound category). Since we do not know what are the underlying "phonemes" of vocal imitations we propose to automatically estimate them using Shift-Invariant Probabilistic Latent Component Analysis (SI-PLCA) applied to a dataset of vocal imitations. The kernel distributions of the SI-PLCA are considered as the "phonemes" of vocal imitation and its impulse distributions are used to compute the emission probabilities of the states of a set of Hidden Markov Models (HMMs). To evaluate our proposal, we test it for a task of automatically recognizing 12 sound categories from their vocal imitations.
机译:在本文中,我们研究了从他们的声音模仿的自动识别声音类别(如冰箱,混频器或锯声)。声音模仿是通过使用在语音中使用的声音机制而产生的声音产生的声音随着时间的推移。我们在这里开发了一种由自动语音识别系统启发的识别方法,具有声学模型(将音频信号映射到“音素”)和语言模型(表示值的“音素”的预期连续每个声音类别)。由于我们不知道声乐模仿的潜在的“音素”是什么,我们建议使用应用于声乐模仿数据集的移位不变概率潜在分量分析(SI-PLCA)自动估计它们。 SI-PLCA的内核分布被认为是声乐模仿的“音素”,其脉冲分布用于计算一组隐马尔可夫模型(HMMS)的排放概率。为了评估我们的提议,我们测试它是一项任务,可以自动识别来自其​​声音模仿的12个声音类别。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号