首页> 外文会议>International Conference on Information Technology Systems and Innovation >On real time Q-log-based feature normalization for distant speech recognition
【24h】

On real time Q-log-based feature normalization for distant speech recognition

机译:基于Q-log的基于Q-log的特征归一化,用于遥远语音识别

获取原文

摘要

The computation of long term mean in feature normalization methods requires information on future frames, and thus makes them inapplicable for real-time implementations. Previously, q-log spectral mean normalization (q-LSMN) as feature normalization method is proposed, and it shows more effective result than conventional normalization methods. However, q-LSMN has not yet been implemented on real time. In this paper, we propose a real time implementation of q-LSMN. In this method, the mean is updated recursively based on only previous frames, hence no future frame information is needed. Experiments on Aurora-5 databases showed that while real time q-LSMN achieved slightly worse performance than non real time q-LSMN as expected, it improved the recognition accuracy up to 54.22% compared to that of non-real time conventional normalization methods such as cepstral mean normalization (CMN) and Log spectral mean normalization (LSMN).
机译:特征归一化方法中长期平均值的计算需要关于未来帧的信息,从而使它们不适用于实时实现。以前,提出了Q-Log光谱均衡(Q-LSMN)作为特征归一化方法,并且它显示比传统的归一化方法更有效的结果。但是,Q-LSMN尚未实时实施。在本文中,我们建议实时实现Q-LSMN。在该方法中,仅基于先前的帧递归地更新平均值,因此不需要未来的帧信息。 Aurora-5数据库的实验表明,虽然Q-LSMN的实时比非实时Q-LSMN实现略差差,但它与非实时常规规范化方法相比,识别精度高达54.22%抗康斯兰语意味着归一化(CMN)和对数光谱均衡(LSMN)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号