首页> 外文OA文献 >Feature extraction using non-linear transformation for robust speech recognition on the Aurora database
【2h】

Feature extraction using non-linear transformation for robust speech recognition on the Aurora database

机译:使用非线性变换进行特征提取以在Aurora数据库上实现健壮的语音识别

摘要

We evaluate the performance of several feature sets on the Aurora task as defined by ETSI. We show that after a non-linear transformation, a number of features can be effectively used in a HMM-based recognition system. The non-linear transformation is computed using a neural network which is discriminatively trained on the phonetically labeled (forcibly aligned) training data. A combination of the non-linearly transformed PLP (perceptive linear predictive coefficients), MSG (modulation filtered spectrogram) and TRAP (temporal pattern) features yields a 63% improvement in error rate as compared to baseline me frequency cepstral coefficients features. The use of the non-linearly transformed RASTA-like features, with system parameters scaled down to take into account the ETSI imposed memory and latency constraints, still yields a 40% improvement in error rate.
机译:我们评估了ETSI定义的Aurora任务上几个功能集的性能。我们表明,经过非线性变换后,可以在基于HMM的识别系统中有效使用许多功能。非线性变换是使用神经网络计算的,该神经网络根据语音标记(强制对齐)的训练数据进行有区别的训练。与基准频率倒频谱系数特征相比,非线性变换的PLP(感知线性预测系数),MSG(调制滤波频谱图)和TRAP(时间模式)特征的组合可将错误率提高63%。使用非线性变换的类似RASTA的功能,并按比例缩小系统参数以考虑到ETSI施加的内存和延迟限制,仍然可以使错误率提高40%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号