首页> 外文会议>Annual conference of the International Speech Communication Association >CRF-based Diacritisation of Colloquial Arabic for Automatic Speech Recognition
【24h】

CRF-based Diacritisation of Colloquial Arabic for Automatic Speech Recognition

机译:基于CRF的口语阿拉伯语自动语音识别

获取原文

摘要

Most of the available resources of colloquial Arabic speech are transcribed without diacritics. Those diacritics provide short vowels and other pronunciation information and by omitting them a considerable amount of ambiguity is introduced. In this paper, we propose the use of an automatic diacritisation method as front-end for training of automatic speech recognition systems of colloquial Arabic. The system used is based on conditional random fields that are trained on speaker and contextual information. This method outperforms other reported methods in diacritisation colloquial Arabic by 13.2% relative. The empirical experiments show that applying this method on acoustic model training transcriptions improves the recognition performance in Levantine colloquial Arabic by 1.8% relative.
机译:口语阿拉伯语语音的大多数可用资源都是在没有变音符号的情况下转录的。这些变音符号提供短的元音和其他发音信息,并且通过省略它们,引入了相当多的歧义。在本文中,我们建议使用自动双歧化方法作为培训口语阿拉伯语自动语音识别系统的前端。所使用的系统基于在说话者和上下文信息上经过训练的条件随机字段。该方法在双锐口语阿拉伯语中比其他报道的方法好13.2%。实验表明,将这种方法应用于声学模型训练转录,相对于黎凡特口语阿拉伯语,其识别性能提高了1.8%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号