首页> 外文会议>Annual conference of the International Speech Communication Association;INTERSPEECH 2011 >Letter-To-Phoneme Conversion based on Two-Stage Neural Network focusing on Letter and Phoneme Contexts
【24h】

Letter-To-Phoneme Conversion based on Two-Stage Neural Network focusing on Letter and Phoneme Contexts

机译:基于两阶段神经网络的字母到音素转换,重点是字母和音素上下文

获取原文

摘要

The improvement of Letter-To-Phoneme (L2P) conversion that can output the phoneme strings corresponding to Out-Of-Vocabulary (OOV) words, especially in English language, has become one of the most important issues in Text-To-Speech (TTS) research. In this paper, we propose a Two-Stage Neural Network (NN) based approach to solve the problem of conflicting output at a phonemic level. Both Letter and Phoneme Context-Dependent models are combined and implemented in the first-stage NN to convert several letters into several phonemes. Then, the second-stage NN can predict the final output phoneme by observing on a combination of several consecutive phoneme sequences that obtained from the first-stage NN. Therefore, our L2P conversion module takes a sequence of letters as input and outputs only one phoneme at each time. By focusing mainly on the result of word accuracy of OOV words, this new approach usually provides a higher performance.
机译:字母到音素(L2P)转换的改进,可以输出与词汇外(OOV)单词相对应的音素字符串,尤其是英语,这已成为“文本到语音”中最重要的问题之一( TTS)研究。在本文中,我们提出了一种基于两阶段神经网络(NN)的方法,以解决音素级别的输出冲突问题。字母和音素上下文相关模型都在第一阶段的NN中组合并实现,以将多个字母转换为多个音素。然后,第二阶段NN可以通过观察从第一阶段NN获得的几个连续音素序列的组合来预测最终的输出音素。因此,我们的L2P转换模块将一系列字母作为输入,并且每次仅输出一个音素。通过主要关注OOV单词的单词准确性的结果,这种新方法通常可提供更高的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号