首页> 外文会议>INTERSPEECH 2012 >Estimation of Talker's Head Orientation Based on Discrimination of the Shape of Cross-power Spectrum Phase Coefficients
【24h】

Estimation of Talker's Head Orientation Based on Discrimination of the Shape of Cross-power Spectrum Phase Coefficients

机译:基于交叉功率谱相位数形状的讲话者头向估计

获取原文

摘要

This paper presents a talker's head orientation estimation method using 2-channel microphones. In recent research, some approaches based on a network of microphone arrays have been proposed in order to estimate the talker's head orientation. In those methods, the talker's head orientation is estimated using the sound amplitude or peak value of CSP (Cross-power Spectrum Phase) coefficients obtained from each microphone array. However, microphone array network systems need many microphone arrays to be set along the walls of a given room so that sub-microphone arrays surround the user. In this paper, we focus on the shape of the CSP coefficients affected by the reverberation, which depends on the talker's position and the head orientation. In our proposed method, we use not only the peak value but also the other values of the CSP coefficients as feature vectors, and the talker's position and the head orientation are es timated by discriminating the CSP vector. The effectiveness of this method has been confirmed by talker localization and head orientation estimation experiments performed in a real environment.
机译:本文介绍了使用2通道麦克风的Talker的头定向估计方法。在最近的研究中,提出了一种基于麦克风阵列网络的方法,以估计Talker的头向定位。在这些方法中,使用从每个麦克风阵列获得的CSP(交叉功率频谱相位)系数的声音幅度或峰值估计Talker的头向。然而,麦克风阵列网络系统需要许多沿着给定房间的壁设定的麦克风阵列,使得子麦克风阵列环绕着用户。在本文中,我们专注于受混响影响的CSP系数的形状,这取决于讲话者的位置和头部方向。在我们所提出的方法中,我们不仅使用峰值,而且使用CSP系数的其他值作为特征向量,并且通过判断CSP向量来进行讲话者的位置和头向定时。通过在真实环境中进行的讲话者定位和头部方向估计实验证实了该方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号