首页> 外文会议>IASTED International Conference on Modelling, Identification and Control >Automatic handling of unseen contexts using a phoneme similarity matrix and its application to text-to-speech synthesis
【24h】

Automatic handling of unseen contexts using a phoneme similarity matrix and its application to text-to-speech synthesis

机译:使用音素相似矩阵自动处理看不见的上下文及其在文本到语音合成中的应用

获取原文
获取外文期刊封面目录资料

摘要

This paper presents a new method for automatic selection of optimal context from a database for an unseen phoneme sequence. If the context is not available for a test phoneme a novel formulation assigns a score to each of the training database phonemes in terms of their context. Normally, a decision tree is used for handling unseen phonemes in context[1,2]. However, this requires building a decision tree for each new language encountered. This may be problematic when developing multi-lingual speech processing systems. In addition, the tree structure may be quite different depending on the language. The proposed formulation incorporates a phoneme similarity matrix which is derived using an acoustic distance measure. This method is applied to selection of best units in a concatenative speech synthesis system, and encouraging results are obtained.
机译:本文介绍了一种新方法,用于从数据库中自动选择不间断的音素序列的最佳上下文。如果上下文不适用于测试音素,则新颖的制定在其上下文中为每个培训数据库音素分配分数。通常,决策树用于处理上下文中的看不见的音素[1,2]。但是,这需要为遇到的每种新语言构建决策树。当开发多语言语音处理系统时,这可能是有问题的。此外,根据语言,树结构可能完全不同。所提出的制剂包括使用声学距离测量来导出的音素相似性矩阵。该方法应用于在连接语音合成系统中的最佳单元的选择,并获得了令人鼓舞的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号