首页> 外文会议> >Inequality Maximum Entropy Classifier with Character Features for Polyphone Disambiguation in Mandarin TTS Systems
【24h】

Inequality Maximum Entropy Classifier with Character Features for Polyphone Disambiguation in Mandarin TTS Systems

机译:具有字符特征的不等式最大熵分类器,用于普通话TTS系统中的语音歧义消除

获取原文

摘要

Grapheme-to-phoneme (G2P) conversion is an important component in TTS systems. The difficulty in Chinese G2P conversion is to disambiguate the polyphones. In this paper, we formulate the polyphone disambiguation problem into a classification problem and propose a language independent classifier based on maximum entropy to address the issue. Furthermore, we introduce inequality smoothing to alleviate data sparseness and exploit language independent character features as linguistic knowledge. Experimental results show that the character features perform as well as the language dependent features such as words and part-of-speech, compared with the widely-used Gaussian smoothing, the inequality smoothing can greatly reduce the active features used in the classifier and achieve better performance. Our classifier achieves 96.35% in term of overall accuracy, greatly superior to 81.22% by using high-frequent "pin-yin"(Romanization of Chinese phoneme). Finally, we explore to merge all key polyphones into 6 groups and find that the overall accuracy only decreases about 2% and the active features are reduced more than 33% further
机译:音素到音素(G2P)转换是TTS系统中的重要组成部分。中文G2P转换的困难在于消除复音语音的歧义。在本文中,我们将多音素消歧问题公式化为分类问题,并提出了基于最大熵的独立于语言的分类器以解决该问题。此外,我们引入不等式平滑以减轻数据稀疏性,并利用独立于语言的字符特征作为语言知识。实验结果表明,与广泛使用的高斯平滑相比,字符特征的表现与语言相关的特征(如单词和词性)表现得更好,与不依赖语言的特征相比,不等式平滑可以大大减少分类器中使用的有效特征并获得更好的效果。表现。我们的分类器的整体准确率达到96.35%,通过使用频繁的“拼音”(汉语音素的罗马化)大大超过了81.22%。最后,我们探索将所有关键的复音电话合并为6组,发现整体准确度仅下降了约2%,有效功能进一步下降了33%以上

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号