HMM-based separation of acoustic transfer function for single-channel sound source localization

机译：基于HMM的声学传递函数分离，用于单通道声源定位

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a sound source (talker) localization method using only a single microphone, where a HMM (Hidden Markov Model) of clean speech is introduced to estimate the acoustic transfer function from a user's position. The new method is able to carry out this estimation without measuring impulse responses. The frame sequence of the acoustic transfer function is estimated by maximizing the likelihood of training data uttered from a given position, where the cepstral parameters are used to effectively represent useful clean speech. Using the estimated frame sequence data, the GMM (Gaussian Mixture Model) of the acoustic transfer function is created to deal with the influence of a room impulse response. Then, for each test data set, we find a maximum-likelihood GMM from among the estimated GMMs corresponding to each position. The effectiveness of this method has been confirmed by talker localization experiments performed in a room environment.

机译：本文提出了一种仅使用单个麦克风的声源（讲话者）定位方法，其中引入了干净语音的HMM（隐马尔可夫模型）以从用户位置估计声学传递函数。新方法能够执行此估计，而无需测量脉冲响应。通过最大化从给定位置发出的训练数据的可能性来估计声学传递函数的帧序列，在此位置，倒谱参数用于有效表示有用的清晰语音。使用估计的帧序列数据，创建声学传递函数的GMM（高斯混合模型）以处理房间脉冲响应的影响。然后，对于每个测试数据集，我们从与每个位置对应的估计GMM中找到最大似然GMM。该方法的有效性已经在室内环境中进行的讲话者定位实验得到证实。

著录项

来源
《IEEE International Conference on Acoustics Speech and Signal;ICASSP 2010》|2010年|p.2830-2833|共4页
会议地点 Dallas, TX(US);Dallas, TX(US)
作者
Takashima, Ryoichi; Takiguchi, Tetsuya; Ariki, Yasuo;
展开▼
作者单位

Graduate School of Engineering Kobe University 1-1 Rokkodai Nada-ku 657-8501 Japan;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
acoustic transfer function; maximum likelihood; single channel; talker localization;

机译：声传递函数；最大似然;单通道说话者本地化;

相似文献

外文文献
中文文献
专利

1. Single-channel talker localization based on separation of the acoustic transfer function using hidden Markov model and its classification [J] . Ryoichi Takashima, Tetsuya Takiguchi, Yasuo Ariki Acoustical science and technology . 2013,第3期

机译：基于隐马尔可夫模型的声学传递函数分离的单通道说话人定位及其分类
2. Single-Channel Talker Localization Based on Discrimination of Acoustic Transfer Functions [J] . Tetsuya Takiguchi, Yuji Sumida, Ryoichi Takashima, EURASIP journal on advances in signal processing . 2009,第4期

机译：基于声学传递函数判别的单通道说话人定位
3. Single-Channel Talker Localization Based on Discrimination of Acoustic Transfer Functions [J] . Tetsuya Takiguchi, Yuji Sumida, Ryoichi Takashima, EURASIP journal on advances in signal processing . 2009,第1期

机译：基于声学传递函数判别的单通道说话人定位
4. Feature selection based on Multiple Kernel Learning for single-channel sound source localization using the acoustic transfer function [C] . Takashima Ryoichi, Takiguchi Tetsuya, Ariki Yasuo 2011 IEEE International Conference on Acoustics, Speech and Signal Processing . 2011

机译：基于多核学习的特征选择，使用声学传递函数进行单通道声源定位
5. Sound, central auditory nervous system function and human gait: The effect of quiet and localized sound sources on the gait of people with normal and atypical central auditory nervous system function [D] . Hubbeling, Charles Robert 1999

机译：声音，中枢听觉神经系统功能和步态：安静且局部的声源对正常和非典型中枢听觉神经系统功能者的步态的影响
6. Single-Channel Multiple-Receiver Sound Source Localization System with Homomorphic Deconvolution and Linear Regression [O] . Yeonseok Park, Anthony Choi, Keonwook Kim 2021

机译：单通道多接收器声源定位系统具有同型折折叠和线性回归
7. Single-channel talker localization based on separation of the acoustic transfer function using hidden Markov model and its classification [O] . Ryoichi Takashima, Tetsuya Takiguchi, Yasuo Ariki 2013

机译：基于使用隐马尔可夫模型的声学传递函数的分离的单通道讲话者定位及其分类

HMM-based separation of acoustic transfer function for single-channel sound source localization

摘要

著录项

相似文献

相关主题

期刊订阅