首页> 外文会议>Annual conference of the International Speech Communication Association >A Weighted Combination of Speech with Text-based Models for Arabic Diacritization
【24h】

A Weighted Combination of Speech with Text-based Models for Arabic Diacritization

机译:语音与基于文本的模型的加权组合,用于阿拉伯二字化

获取原文

摘要

The majority of studies on Arabic diacritization have employed textually inferred features alone. This paper proposes a novel approach, where the weighted combination of speech with a text-based model is used to allow linguistically-insensitive acoustic information to correct and complement the errors generated by the text model's diacritic predictions. The acoustic model is based on Hidden Markov Models and the textual model on Conditional Random Fields. The combination brings significant reduction in error rates across all metrics, especially in case endings, which are the most difficult to predict. It gives results superior to those of conventional methods, with diacritic and word error rates of 1.5 and 4.9 inclusive of case endings, and 1.0 and 2.7 exclusive of them.
机译:大多数关于阿拉伯语变音的研究仅采用了文字推断功能。本文提出了一种新颖的方法,其中语音与基于文本的模型的加权组合用于允许对语言不敏感的声学信息来纠正和补充由文本模型的变音符号预测所产生的错误。声学模型基于隐马尔可夫模型和条件随机场的文本模型。这种组合可显着降低所有指标的错误率,尤其是在最难以预测的情况下。它提供的结果优于传统方法,变音符号和单词错误率分别为1.5和4.9(包括大小写结尾),而1.0和2.7除外。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号