首页> 外文会议>IASTED International Conference on Modelling, Identification and Control >Automatic handling of unseen contexts using a phoneme similarity matrix and its application to text-to-speech synthesis

Automatic handling of unseen contexts using a phoneme similarity matrix and its application to text-to-speech synthesis




This paper presents a new method for automatic selection of optimal context from a database for an unseen phoneme sequence. If the context is not available for a test phoneme a novel formulation assigns a score to each of the training database phonemes in terms of their context. Normally, a decision tree is used for handling unseen phonemes in context[1,2]. However, this requires building a decision tree for each new language encountered. This may be problematic when developing multi-lingual speech processing systems. In addition, the tree structure may be quite different depending on the language. The proposed formulation incorporates a phoneme similarity matrix which is derived using an acoustic distance measure. This method is applied to selection of best units in a concatenative speech synthesis system, and encouraging results are obtained.



  • 外文文献
  • 中文文献
  • 专利


京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号