【24h】

An Inhibition/Enhancement network for noise robust ASR

机译:抑制/增强网络,用于增强噪声的ASR

获取原文

摘要

This paper describes an evaluation of Inhibition/Enhancement (In/En) network for noise robust automatic speech recognition (ASR). In articulatory feature based speech recognition using neural network, the In/En network is needed to discriminate whether the articulatory features (AFs) dynamic patterns of trajectories are convex or concave. The network is used to achieve categorical AFs movement by enhancing AFs peak patterns (convex patterns) and inhibiting AFs dip patterns (concave patterns). We have analyzed the effectiveness of the In/En algorithm by incorporating it into a system which consists of three stages: a) Multilayer Neural Networks (MLNs), b) an In/En Network and c) the Gram-Schmidt (GS) algorithm for orthogonalization. From the experiments using Japanese Newspaper Article Sentences (JNAS) database in clean and noisy acoustic environments, it is observed that the In/En network plays a significant role on the improvement of phoneme recognition performance. Moreover, the In/En network reduces the number of mixture components needed in Hidden Markov Models (HMMs).
机译:本文介绍了对抑制/增强(In / En)网络进行的鲁棒自动语音识别(ASR)评估。在使用神经网络的基于发音特征的语音识别中,需要In / En网络来区分轨迹的发音特征(AFs)动态模式是凸的还是凹的。该网络用于通过增强AF的峰值模式(凸型)和抑制AF的倾斜模式(凹型)来实现AF的分类运动。我们通过将In / En算法合并到一个包含三个阶段的系统中来分析In / En算法的有效性:a)多层神经网络(MLN),b)In / En网络和c)Gram-Schmidt(GS)算法用于正交化。从在干净和嘈杂的声学环境中使用“日本报纸文章句子”(JNAS)数据库进行的实验中,可以看到In / En网络在提高音素识别性能方面起着重要作用。此外,In / En网络减少了隐马尔可夫模型(HMM)中所需的混合组分的数量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号