首页> 外文会议>2nd Conference on Speech Technology and Human-Computer Dialogue; Apr 10-11, 2003; Bucharest >AN ANN-BASED METHOD TO IMPROVE THE PHONETIC TRANSCRIPTION AND PROSODY MODULES OF A TTS SYSTEM FOR THE ROMANIAN LANGUAGE
【24h】

AN ANN-BASED METHOD TO IMPROVE THE PHONETIC TRANSCRIPTION AND PROSODY MODULES OF A TTS SYSTEM FOR THE ROMANIAN LANGUAGE

机译:基于人工神经网络的罗马尼亚语TTS系统语音翻译和韵律模块的改进

获取原文
获取原文并翻译 | 示例

摘要

High quality Text-to-Speech applications require input text pre-processing in order to provide the speech synthesizer with reliable information regarding both phoneme parameters and prosody. Part of this mandatory pre-processing refers to phonetic transcription, syllable delimitation and stressed syllable identification in words. The translation of letters (or graphemes) strings into sequences of phonemes may be done almost entirely by rule-based methods, built on knowledge extracted from phonetic and phonologic rules, because the Romanian language has a phonetic character. Nevertheless, there may exist situations that cannot be entirely characterized by explicit rules, namely those of the two/three consecutive vowels pronounced either in a diphthong/triphthong, or in a hiatus. We describe our approach for Romanian phonetic transcription and stress assignation by use of two ANN sub-modules. The first sub-module implements the word hyphenation by an automatic learning method, solving the decision diphthong/triphthong versus hiatus (including semivowels identification), also covering the exceptions to existing syllabication rules. The output supplies information to the Letter-to-Sound module, for completing the phonetic transcription. The second ANN sub-module implements a solution for stressed syllable identification, as required in the prosody module.
机译:高质量的文本到语音应用程序需要对输入文本进行预处理,以便为语音合成器提供有关音素参数和韵律的可靠信息。这种强制性预处理的一部分涉及语音转录,音节定界和单词的重音节识别。将字母(或字素)字符串转换成音素序列的过程几乎可以完全通过基于规则的方法完成,该方法基于从语音和语音规则中提取的知识,因为罗马尼亚语具有语音特性。然而,可能存在无法完全由显式规则表征的情况,即以二重音/三重音或中断音发音的两个/三个连续元音的情况。我们通过两个ANN子模块描述了罗马尼亚语音转录和重音分配的方法。第一个子模块通过自动学习方法实现单词连字,解决了双音/三音与断音的决策(包括半元音识别),还涵盖了现有音节规则的例外。输出将信息提供给Letter-to-Sound模块,以完成语音转录。第二个ANN子模块根据韵律模块的要求,实现了用于重读音节识别的解决方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号