首页> 外文会议>International Conference on Image and Signal Processing >Robust Arabic Multi-stream Speech Recognition System in Noisy Environment
【24h】

Robust Arabic Multi-stream Speech Recognition System in Noisy Environment

机译:嘈杂环境中强大的阿拉伯语多流语音识别系统

获取原文

摘要

In this paper, the framework of multi-stream combination has been explored to improve the noise robustness of automatic speech recognition systems. The main important issues of multi-stream systems are which features representation to combine and what importance (weights) be given to each one. Two stream features have been investigated, namely the MFCC features and a set of complementary features which consists of pitch frequency, energy and the first three formants. Empiric optimum weights are fixed for each stream. The multi-stream vectors are modeled by Hidden Markov Models (HMMs) with Gaussian Mixture Models (GMMs) state distributions. Our ASR is implemented using HTK toolkit and ARADIGIT corpus which is data base of Arabic spoken words. The obtained results show that for highly noisy speech, the proposed multi-stream vectors leads to a significant improvement in recognition accuracy.
机译:本文已经探讨了多流组合的框架,以提高自动语音识别系统的噪声稳健性。多流系统的主要重要问题是结合组合的特征表示以及每个重点(重量)被给予。已经研究了两个流特征,即MFCC特征和由音高频率,能量和前三种塑料组成的一组互补特征。为每个流固定经验最佳重量。多流矢量通过带有高斯混合模型(GMMS)状态分布的隐马尔可夫模型(HMMS)进行建模。我们的ASR是使用HTK Toolkit和Aradigit语料库来实现的,该语料库是阿拉伯语口语单词的数据库。获得的结果表明,对于高度嘈杂的言论,所提出的多流矢量导致识别准确性的显着提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号