首页> 外文会议>Conference Record of the Forty Fourth Asilomar Conference on Signals, Systems and Computers >Group delay based methods for recognition of distant talking speech
【24h】

Group delay based methods for recognition of distant talking speech

机译:基于群时延的远距离语音识别方法

获取原文

摘要

The group delay function has been used conventionally in temporal spectral analysis and feature extraction for speech recognition. In this work we present a detailed analysis of a novel approach to spatial spectral analysis of speech using the MUSIC-Group delay spectrum. In our previous work we have proposed the use of the MUSIC-Group delay spectrum [ICASSP 2010], for direction of arrival estimation (DOA) and distant speech recognition. We discuss the advantages of the proposed method in terms of resolving closely spaced speech sources with minimal number of sensors. This method is also analyzed from a minimum phase perspective as is done in temporal processing of speech. Additional analysis is performed using the Pisarenko-Group delay spectrum in terms of real time performance. DOAs estimated from the proposed approach are used to train filter and sum beamformers. Distant speech recognition experiments in clean and reverberant conditions using the beamformed speech signal indicate reasonable improvements over correlation and sub space based methods.
机译:群体延迟功能通常已用于时间频谱分析和特征提取以进行语音识别。在这项工作中,我们介绍了一种使用MUSIC-Group延迟频谱进行语音空间频谱分析的新方法的详细分析。在我们之前的工作中,我们已经建议将MUSIC组延迟频谱[ICASSP 2010]用于到达方向估计(DOA)和远距离语音识别。我们讨论了用最小数量的传感器来解决紧密间隔的语音源方面所提出的方法的优点。还从最小相位角度分析此方法,就像在语音的时间处理中所做的一样。就实时性能而言,使用Pisarenko-Group延迟频谱进行了其他分析。从提出的方法估计的DOA用于训练滤波器和求和波束形成器。使用波束赋形的语音信号在干净和混响条件下进行的远距离语音识别实验表明,在基于相关性和子空间的方法上有合理的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号