首页> 外文会议>2011 IEEE International Conference on Acoustics, Speech and Signal Processing >Rapid phonetic transcription using everyday life natural Chat Alphabet orthography for dialectal Arabic speech recognition
【24h】

Rapid phonetic transcription using everyday life natural Chat Alphabet orthography for dialectal Arabic speech recognition

机译:使用日常生活自然的聊天字母拼写法进行快速语音转录,以实现方言阿拉伯语音识别

获取原文

摘要

We propose the Arabic Chat Alphabet (ACA) as naturally written in everyday life for dialectal Arabic speech transcription. Our assumption is that ACA is a natural language that includes short vowels that are missing in traditional Arabic orthography. Furthermore, ACA transcriptions can be rapidly prepared. Egyptian Colloquial Arabic was chosen as a typical dialect. Two speech recognition baselines were built: phonemic and graphemic. Original transcriptions were re-written in ACA by different transcribers. Ambiguous ACA sequences were handled by automatically generating all possible variants. ACA variations across transcribers were modeled by phonemes normalization and merging. Results show that the ACA-based approach outperforms the graphemic baseline while it performs as accurate as the phoneme-based baseline with a slight increase in WER.
机译:我们提议在日常生活中自然编写的阿拉伯语聊天字母(ACA)用于方言阿拉伯语语音转录。我们的假设是,ACA是一种自然语言,其中包含传统阿拉伯语正字法中缺少的短元音。此外,可以快速准备ACA转录。埃及口语阿拉伯语被选为典型的方言。建立了两个语音识别基线:音素和音素。原始抄本由不同的抄写员在ACA中重写。通过自动生成所有可能的变体来处理模棱两可的ACA序列。通过音素归一化和合并来模拟跨转录者的ACA变异。结果表明,基于ACA的方法优于基于音素的基线,而其性能与基于音素的基线一样准确,并且WER略有增加。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号