首页> 外文会议>Conference on Innovations in Information Technologies >Investigating Automatic Recognition of Non-Native Arabic Speech
【24h】

Investigating Automatic Recognition of Non-Native Arabic Speech

机译:调查自然阿拉伯语演讲的自动识别

获取原文

摘要

Pronunciation variability is by far the most critical issue for Arabic Automatic Speech Recognition (AASR). The problem is further complicated when AASR needs to deal with both native and non-native accents. In this paper, we are concerned with the problem of non-native speech in a speaker independent, large-vocabulary speech recognition system for Modern Standard Arabic (MSA). We analyze some major differences related to the phonetic confusion in order to determine which phonemes have a significant part in the recognition performance for both native and non-native speakers. The WestPoint Language Data Consortium (LDC) modern standard Arabic database and the Hidden Markov Model Toolkit (HTK) are used in this research effort. We analyzed the performance of AASR at phonetic and word levels and we found that the introduction of the language model masks the pronunciation problems of non-native speakers.
机译:发音变异性是迄今为止阿拉伯自动语音识别(AASR)的最关键问题。当AASR需要处理原生和非本地性口音时,问题进一步复杂化。在本文中,我们涉及用于现代标准阿拉伯语(MSA)的扬声器独立,大词汇语音识别系统中的非原生语音问题。我们分析与语音混淆相关的一些主要差异,以确定哪些音素在本机和非母语人员的认可性能方面具有重要部分。 WestPoint语言数据联盟(LDC)现代标准阿拉伯语数据库和隐藏的Markov Model Toolkit(HTK)用于本研究工作。我们分析了奥斯尔在语音和词语水平时的表现,我们发现语言模型的引入掩盖了非母语扬声器的发音问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号