首页> 外文会议>2007 IEEE International Conference on Natural Language Processing and Knowledge Engineering(NLP-KE'07) >A Comparative Study of Diverse Knowledge Sources and Smoothing Techniques via Maximum Entropy for Polyphone Disambiguation in Mandarin TTS Systems
【24h】

A Comparative Study of Diverse Knowledge Sources and Smoothing Techniques via Maximum Entropy for Polyphone Disambiguation in Mandarin TTS Systems

机译:基于最大熵的普通话TTS系统中多音歧义消歧的各种知识源和平滑技术的比较研究

获取原文

摘要

This paper comparatively evaluated various knowledge sources and smoothing algorithms for pronunciation disambiguation in Mandarin TTS systems under maximum entropy (maxent) framework. In particular, five kinds of knowledge sources, namely characters and their pronunciations,words, their pronunciations and part-of-speech, together with two smoothing algorithms,I.e. Gaussian prior and inequality were compared. In our experiments conducted on 107 key Chinese polyphones, we found that all the knowledge sources almost perform equally well given the same smoothing measure,but the character-based features compare favorably because they are language independent and can be obtained with the lowest computation cost. Compared with the widely-used Gaussian smoothing, the equality smoothing greatly reduces the number of active features and yields a slightly improved accuracy on each knowledge source. Our best result (96.36%) is achieved by using character-based features together with the inequality smoothing, significantly superior to 81.22% by selecting the most frequent pronunciations and 88.72% by dictionary look-up with the part-of-speech. We also compared the maxent classifier with the transform-based error-driven learning algorithm (E.Brill, 1995) using the same knowledge sources,the results show that the maxent classifier achieve better performance to solve the polyphone disambiguation.
机译:本文比较了最大熵(maxent)框架下普通话TTS系统语音歧义消除的各种知识来源和平滑算法。尤其是五种知识源,即字符及其发音,单词,语音和词性以及两种平滑算法,即比较了高斯先验和不等式。在我们对107台中文按键式录音机进行的实验中,我们发现,在使用相同的平滑措施的情况下,所有知识源几乎都表现良好,但是基于字符的功能却具有优势,因为它们与语言无关,并且可以以最低的计算成本获得。与广泛使用的高斯平滑相比,等式平滑大大减少了活动特征的数量,并且在每个知识源上的准确性都有所提高。通过使用基于字符的功能和不等式平滑,可以达到最佳效果(96.36%),通过选择最常用的发音明显优于81.22%,通过使用词性的字典查找可以明显胜过88.72%。我们还使用相同的知识来源,将maxent分类器与基于变换的错误驱动学习算法(E.Brill,1995)进行了比较,结果表明maxent分类器能够更好地解决多音素歧义消除问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号