Letter-To-Phoneme Conversion based on Two-Stage Neural Network focusing on Letter and Phoneme Contexts

机译：基于两阶段神经网络的字母到音素转换，重点是字母和音素上下文

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The improvement of Letter-To-Phoneme (L2P) conversion that can output the phoneme strings corresponding to Out-Of-Vocabulary (OOV) words, especially in English language, has become one of the most important issues in Text-To-Speech (TTS) research. In this paper, we propose a Two-Stage Neural Network (NN) based approach to solve the problem of conflicting output at a phonemic level. Both Letter and Phoneme Context-Dependent models are combined and implemented in the first-stage NN to convert several letters into several phonemes. Then, the second-stage NN can predict the final output phoneme by observing on a combination of several consecutive phoneme sequences that obtained from the first-stage NN. Therefore, our L2P conversion module takes a sequence of letters as input and outputs only one phoneme at each time. By focusing mainly on the result of word accuracy of OOV words, this new approach usually provides a higher performance.

机译：字母到音素（L2P）转换的改进，可以输出与词汇外（OOV）单词相对应的音素字符串，尤其是英语，这已成为“文本到语音”中最重要的问题之一（ TTS）研究。在本文中，我们提出了一种基于两阶段神经网络（NN）的方法，以解决音素级别的输出冲突问题。字母和音素上下文相关模型都在第一阶段的NN中组合并实现，以将多个字母转换为多个音素。然后，第二阶段NN可以通过观察从第一阶段NN获得的几个连续音素序列的组合来预测最终的输出音素。因此，我们的L2P转换模块将一系列字母作为输入，并且每次仅输出一个音素。通过主要关注OOV单词的单词准确性的结果，这种新方法通常可提供更高的性能。

著录项

来源
《Annual conference of the International Speech Communication Association;INTERSPEECH 2011》|2011年|p.1896-1899|共4页
会议地点
作者
Kheang Seng; Yurie Iribe; Tsuneo Nitta;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
letter context-dependent model; phoneme context-dependent model; phoneme sequences pattern- observation model; many-to-many alignment and two-stage neural network based approach;

机译：字母上下文相关模型音素上下文相关模型;音素序列模式观察模型;基于多对多对齐和两阶段神经网络的方法;
入库时间 2022-08-26 15:06:06

相似文献

外文文献
中文文献
专利

1. Improving the performance of Letter-To-Phoneme conversion by using Two-Stage Neural Network [J] . KHEANG SENG, KOUICHIKATSURADA, YURIE IRIBE, 電子情報通信学会技術研究報告 . 2012,第369期

机译：通过使用两阶段神经网络提高字母到音素转换的性能
2. Improving the performance of Letter-To-Phoneme conversion by using Two-Stage Neural Network [J] . KHEANG SENG, KOUICHI KATSURADA, YURIE IRIBE, 電子情報通信学会技術研究報告. 音声. Speech . 2012,第369期

机译：通过使用两阶段神经网络提高字母到音素转换的性能
3. Solving the Phoneme Conflict in Grapheme-to-Phoneme Conversion Using a Two-Stage Neural Network-Based Approach [J] . Seng KHEANG, Kouichi KATSURADA, Yurie IRIBE, IEICE transactions on information and systems . 2014,第4期

机译：使用基于两阶段神经网络的方法解决音素到音素转换中的音素冲突
4. A Hybride Approach For Grapheme-to-Phoneme Conversion Based o na Combination of Partial String matching and A Neural Network [C] . Horst-Udo Hain 6th International conference on Spoken Language Processing ICSLP 2000 Oct. 16-Oct.20 2000 Beijing International Convention Center, Beijing, China . 2000

机译：基于部分字符串匹配和神经网络组合的音素到音素转换的混合方法
5. Diagnosis and prognosis of electrical and mechanical faults using wireless sensor networks and two-stage neural network classifier. [D] . Ramani, Akarsha. 2008

机译：使用无线传感器网络和两阶段神经网络分类器对机电故障进行诊断和预后。
6. Toward an Executive Origin for Acquired Phonological Dyslexia: A Case of Specific Deficit of Context-Sensitive Grapheme-to-Phoneme Conversion Rules [O] . Noémie Auclair-Ouellet, Marion Fossard, Marie-Catherine St-Pierre, 2013

机译：迈向获得性语音阅读障碍的行政起源：上下文敏感的音素到音素转换规则的特定缺陷案例
7. Solving the Phoneme Conflict in Grapheme-to-Phoneme Conversion Using a Two-Stage Neural Network-Based Approach [O] . Seng KHEANG, Kouichi KATSURADA, Yurie IRIBE, 2014

机译：使用基于两阶段的神经网络的方法解决标记到音素转换的音素冲突

Letter-To-Phoneme Conversion based on Two-Stage Neural Network focusing on Letter and Phoneme Contexts

摘要

著录项

相似文献

相关主题

期刊订阅