首页> 外文期刊>Procedia Computer Science >Applying Linguistic G2P Knowledge on a Statistical Grapheme-to-phoneme Conversion in Khmer
【24h】

Applying Linguistic G2P Knowledge on a Statistical Grapheme-to-phoneme Conversion in Khmer

机译:语言G2P知识在高棉语中的统计音素到音素转换中的应用

获取原文
       

摘要

A Grapheme-to-Phoneme (G2P) convertor generates the pronunciation given a word. G2P is an important module in a speech synthesis system and an automatic speech recognition system. Two main G2P approaches are: knowledge-based and data-driven. The knowledge based G2P is built based on linguist knowledge. The data-driven approach such as the statistical approach on the other hand does not need expert knowledge, but it requires data to learn the rules. In this research, we propose an approach that combines linguistic knowledge into a statistical-based G2P convertor for Khmer. We examined a simple way of adding linguistic knowledge into the statistical G2P convertor by simply inserting vowel tags into a Khmer word. Three types of vowel tags were used. The main strength of this approach is it combines the strength of linguistic knowledge and statistical-based approach, to build a robust G2P model. The information allows better modeling and prediction of the phoneme sequence, thus improving the phoneme error rate (PER) and word error rate (WER). The PER and WER of our proposed Khmer G2P improve from 23.2% and 69.6% to 11.1% and 51.4% respectively.
机译:音素到音素(G2P)转换器生成给定单词的发音。 G2P是语音合成系统和自动语音识别系统中的重要模块。 G2P的两种主要方法是:基于知识的方法和基于数据的方法。基于知识的G2P是基于语言知识构建的。另一方面,诸如统计方法之类的数据驱动方法不需要专家知识,但是需要数据来学习规则。在这项研究中,我们提出了一种将语言知识结合到高棉语的基于统计的G2P转换器中的方法。我们研究了一种简单的将语言知识添加到统计G2P转换器中的方法,只需在高棉语单词中插入元音标签即可。使用了三种类型的元音标签。这种方法的主要优点是它将语言知识和基于统计的方法相结合,以构建健壮的G2P模型。该信息可以更好地建模和预测音素序列,从而提高音素错误率(PER)和字错误率(WER)。我们提议的高棉G2P的PER和WER分别从23.2%和69.6%提高到11.1%和51.4%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号