首页> 外文会议>International conference on Asian language processing >Mongolian prosodic phrase prediction using suffix segmentation
【24h】

Mongolian prosodic phrase prediction using suffix segmentation

机译:使用后缀分割的蒙古韵律短语预测

获取原文

摘要

Accurate prosodic phrase prediction can improve the naturalness of speech synthesis. Predicting the prosodic phrase can be regarded as a sequence labeling problem and the Conditional Random Field (CRF) is typically used to solve it. Mongolian is an agglutinative language, in which massive words can be formed by concatenating these stems and suffixes. This character makes it difficult to build a Mongolian prosodic phrase predictions system, based on CRF, that has high performance. We introduce a new method that segments Mongolian word into stem and suffix as individual token. The proposed method integrates multiple features according to the characteristics of Mongolian word formation. We conduct the contrast experiment by selecting the following features: word, multi-level Part-of-Speech (POS), multi-level lexical for suffix and the existence for suffix. The experimental results show that our method has significantly enhanced the performance of the Mongolian prosodic phrase prediction system through comparing with the conventional method that treats Mongolian word as token directly. The word feature, level one lexical for suffix feature and existence for suffix feature are effective. The best result is measured by Fl-measure as 82.49%.
机译:准确的韵律短语预测可以提高语音合成的自然性。预测韵律短语可以看作是序列标记问题,通常使用条件随机场(CRF)来解决。蒙古语是一种凝集性语言,通过将这些词干和后缀串联在一起可以形成大量单词。此字符使基于CRF构建具有高性能的蒙古韵律短语预测系统变得困难。我们引入了一种新方法,将蒙古语单词分为词干和后缀作为单独的标记。该方法根据蒙古语的构词特点综合了多种特征。我们通过选择以下特征来进行对比实验:单词,多级词性(POS),后缀的多层词法和后缀的存在。实验结果表明,与直接将蒙古语词作为记号的传统方法相比,本方法大大提高了蒙古韵律短语预测系统的性能。词特征,后缀特征的一级词汇和后缀特征的存在是有效的。最好的结果通过F1测量测得为82.49%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号