首页> 外文会议>IEEE International Conference on Acoustics, Speech, and Signal Processing >MORPHEME-BASED FEATURE-RICH LANGUAGE MODELS USING DEEP NEURAL NETWORKS FOR LVCSR OF EGYPTIAN ARABIC
【24h】

MORPHEME-BASED FEATURE-RICH LANGUAGE MODELS USING DEEP NEURAL NETWORKS FOR LVCSR OF EGYPTIAN ARABIC

机译:基于语素的功能丰富的语言模型,使用深神经网络为埃及阿拉伯语的LVCSR

获取原文

摘要

Egyptian Arabic (EA) is a colloquial version of Arabic. It is a low-resource morphologically rich language that causes problems in Large Vocabulary Continuous Speech Recognition (LVCSR). Building LMs on morpheme level is considered a better choice to achieve higher lexical coverage and better LM probabilities. Another approach is to utilize information from additional features such as morphological tags. On the other hand, LMs based on Neural Networks (NNs) with a single hidden layer have shown superiority over the conventional n-gram LMs. Recently, Deep Neural Networks (DNNs) with multiple hidden layers have achieved better performance in various tasks. In this paper, we explore the use of feature-rich DNN-LMs, where the inputs to the network are a mixture of words and morphemes along with their features. Significant Word Error Rate (WER) reductions are achieved compared to the traditional word-based LMs.
机译:埃及阿拉伯语(EA)是阿拉伯语的口语版本。它是一种低资源形态丰富的语言,导致大词汇连续语音识别(LVCSR)中的问题。在语素水平上建立LMS被认为是实现更高的词汇覆盖和更好的LM概率的更好选择。另一种方法是利用来自诸如形态标签的附加特征的信息。另一方面,基于具有单个隐藏层的神经网络(NNS)的LMS显示出通过传统的N-GRAM LMS的优越性。最近,具有多个隐藏层的深度神经网络(DNN)在各种任务中取得了更好的性能。在本文中,我们探讨了使用功能丰富的DNN-LMS,其中对网络的输入是单词和语素的混合以及它们的特征。与传统的基于Word的LMS相比,实现了显着的单词错误率(WER)缩减。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号