首页> 外文会议>Joint Workshop on Hands-free Speech Communication and Microphone Arrays >An analysis of nonstationary variance estimates in the maximum negentropy beamformer
【24h】

An analysis of nonstationary variance estimates in the maximum negentropy beamformer

机译:最大共阴波束形成器中的非标失方差估计分析

获取原文

摘要

This work extends a beamforming algorithm intended for automatic recognition of speech data captured with an array of distant microphones. In addition to enforcing a distortionless constraint in a desired direction, the algorithm adjusts the sensor weights so as to maximize a negentropy criterion. Negentropy is a measure of how non-Gaussian the probability density function (pdf) of a random variable is, and thus its computation depends on a number of pdf parameters. Here time-dependent pdf parameters are introduced to account for the nonstationarity of speech. Several methods are evaluated in a set of far-field ASR experiments. It is found that phone-length windows for the estimation lead to an increase of word error rate, and an analysis is provided that clarifies the reason for this behavior. Most importantly, we provide evidence that negentropy may not be an ideal cost criterion, not only when using phone-dependent parameters, but also in the original system.
机译:该工作扩展了一种用于自动识别捕获的遥控麦克风阵列的语音数据的波束形成算法。 除了在期望的方向上实施无失真约束之外,该算法还调整传感器权重,以最大化进入的降级标准。 上部是一种衡量随机变量的概率密度函数(PDF)的衡量标准,因此其计算取决于许多PDF参数。 这里引入了时间相关的PDF参数,以解释语音的非间抗性。 在一组远场ASR实验中评估了几种方法。 有发现,用于估计的电话长度窗口导致单词错误率的增加,并提供了分析,以阐明这种行为的原因。 最重要的是,我们提供了证据表明,不仅在使用电话依赖的参数时,而且在原始系统中也不只有在理想的成本标准。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号