首页> 外文期刊>IEICE Transactions on Information and Systems >Distant Speech Recognition Using a Microphone Array Network
【24h】

Distant Speech Recognition Using a Microphone Array Network

机译:使用麦克风阵列网络进行远程语音识别

获取原文
获取原文并翻译 | 示例
       

摘要

In this work, spatial information consisting of the position and orientation angle of an acoustic source is estimated by an artificial neural network (ANN). The estimated position of a speaker in an enclosed space is used to refine the estimated time delays for a delay-and-sum beam-former, thus enhancing the output signal. On the other hand, the orientation angle is used to restrict the lexicon used in the recognition phase, assuming that the speaker faces a particular direction while speaking. To compensate the effect of the transmission channel inside a short frame analysis window, a new cepstral mean normalization (CMN) method based on a Gaussian mixture model (GMM) is investigated and shows better performance than the conventional CMN for short utterances. The performance of the proposed method is evaluated through lapanese digit/command recognition experiments.
机译:在这项工作中,由人工神经网络(ANN)估算由声源的位置和定向角组成的空间信息。扬声器在封闭空间中的估计位置用于完善延迟和求和波束形成器的估计时间延迟,从而增强输出信号。另一方面,假设说话者说话时面对特定方向,则定向角用于限制识别阶段中使用的词典。为了补偿短帧分析窗口内传输通道的影响,研究了一种基于高斯混合模型(GMM)的新倒谱均值归一化(CMN)方法,该方法在短话语方面表现出比常规CMN更好的性能。通过拉丁美洲数字/命令识别实验评估了该方法的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号