首页> 外文会议>2011 Annual IEEE India Conference : Engineering Sustainable Solutions >Memory-based data-driven approach for grapheme-to-phoneme conversion in Bengali text-to-speech synthesis system
【24h】

Memory-based data-driven approach for grapheme-to-phoneme conversion in Bengali text-to-speech synthesis system

机译:孟加拉语文本到语音合成系统中基于内存的数据驱动的字素到音素转换方法

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

In this paper, we propose a memory-based data-driven model for grapheme-to-phoneme (G2P) conversion for Bengali text-to-speech synthesis (TTS) system. Previous studies have stated the significance of the linguistic and phonetic features for rule-based Bengali G2P conversion techniques. But due to the lack of proper morphological analyzer, the scope of rule-based approaches is bounded. The proposed method overcomes the limitation of rule-based methods by exploiting the variety of contexts present in the text corpus built in the current study. The model has been trained with a memory-base showing the relation between graphs and phones based on contexts. The model has been tested with 300 random words and it achieved accuracy of 79.33% at word-level and 96.28% at graph-level. This performance has been compared with a related rule-based approach to prove the effectiveness of a data-driven method. Furthermore, the model doesn't require any morphological knowledge of the words.
机译:在本文中,我们提出了一个基于内存的数据驱动模型,用于孟加拉语文本到语音合成(TTS)系统的字素到音素(G2P)转换。以前的研究已经指出了基于规则的孟加拉G2P转换技术的语言和语音功能的重要性。但是由于缺乏适当的形态分析器,基于规则的方法的范围受到限制。所提出的方法通过利用当前研究中建立的文本语料库中存在的各种上下文,克服了基于规则的方法的局限性。该模型已通过存储库进行了训练,该存储库基于上下文显示了图表和电话之间的关系。该模型已用300个随机单词进行了测试,在单词级别的准确性达到79.33%,在图形级别的准确性达到96.28%。将该性能与相关的基于规则的方法进行了比较,以证明数据驱动方法的有效性。此外,该模型不需要单词的任何形态学知识。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号