首页> 外文会议>IEEE International Conference on Acoustics Speech and Signal;ICASSP 2010 >HMM-based separation of acoustic transfer function for single-channel sound source localization
【24h】

HMM-based separation of acoustic transfer function for single-channel sound source localization

机译:基于HMM的声学传递函数分离,用于单通道声源定位

获取原文

摘要

This paper presents a sound source (talker) localization method using only a single microphone, where a HMM (Hidden Markov Model) of clean speech is introduced to estimate the acoustic transfer function from a user's position. The new method is able to carry out this estimation without measuring impulse responses. The frame sequence of the acoustic transfer function is estimated by maximizing the likelihood of training data uttered from a given position, where the cepstral parameters are used to effectively represent useful clean speech. Using the estimated frame sequence data, the GMM (Gaussian Mixture Model) of the acoustic transfer function is created to deal with the influence of a room impulse response. Then, for each test data set, we find a maximum-likelihood GMM from among the estimated GMMs corresponding to each position. The effectiveness of this method has been confirmed by talker localization experiments performed in a room environment.
机译:本文提出了一种仅使用单个麦克风的声源(讲话者)定位方法,其中引入了干净语音的HMM(隐马尔可夫模型)以从用户位置估计声学传递函数。新方法能够执行此估计,而无需测量脉冲响应。通过最大化从给定位置发出的训练数据的可能性来估计声学传递函数的帧序列,在此位置,倒谱参数用于有效表示有用的清晰语音。使用估计的帧序列数据,创建声学传递函数的GMM(高斯混合模型)以处理房间脉冲响应的影响。然后,对于每个测试数据集,我们从与每个位置对应的估计GMM中找到最大似然GMM。该方法的有效性已经在室内环境中进行的讲话者定位实验得到证实。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号