首页> 外文会议>Annual conference of the International Speech Communication Association >Estimation of Talker's Head Orientation Based on Discrimination of the Shape of Cross-power Spectrum Phase Coefficients
【24h】

Estimation of Talker's Head Orientation Based on Discrimination of the Shape of Cross-power Spectrum Phase Coefficients

机译:基于互功率谱相位系数形状判别的讲话者头部取向估计

获取原文

摘要

This paper presents a talker's head orientation estimation method using 2-channel microphones. In recent research, some approaches based on a network of microphone arrays have been proposed in order to estimate the talker's head orientation. In those methods, the talker's head orientation is estimated using the sound amplitude or peak value of CSP (Cross-power Spectrum Phase) coefficients obtained from each microphone array. However, microphone array network systems need many microphone arrays to be set along the walls of a given room so that sub-microphone arrays surround the user. In this paper, we focus on the shape of the CSP coefficients affected by the reverberation, which depends on the talker's position and the head orientation. In our proposed method, we use not only the peak value but also the other values of the CSP coefficients as feature vectors, and the talker's position and the head orientation are estimated by discriminating the CSP vector. The effectiveness of this method has been confirmed by talker localization and head orientation estimation experiments performed in a real environment.
机译:本文提出了一种使用2通道麦克风的讲话者头部取向估计方法。在最近的研究中,已经提出了一些基于麦克风阵列网络的方法,以便估计讲话者的头部方向。在那些方法中,使用从每个麦克风阵列获得的CSP(交叉功率频谱相位)系数的声音幅度或峰值来估算讲话者的头部方位。但是,麦克风阵列网络系统需要沿着给定房间的墙壁设置许多麦克风阵列,以使子麦克风阵列围绕用户。在本文中,我们关注受混响影响的CSP系数的形状,该形状取决于讲话者的位置和头部的朝向。在我们提出的方法中,我们不仅使用峰值,而且还将CSP系数的其他值用作特征向量,并且通过区分CSP向量来估计讲话者的位置和头部方向。该方法的有效性已通过在真实环境中进行的讲话者定位和头部方位估计实验得到证实。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号