【24h】

Part of Speech Tagging for Romanian Text-to-Speech System

机译:罗马尼亚文本语音转换系统语音标记的一部分

获取原文
获取原文并翻译 | 示例

摘要

This paper describes a Part of Speech (POS) tagger that has been developed for Romanian Text-to-Speech purposes. In our Text-to-Speech (TTS) system, the Part of Speech tagger is used to disambiguate the pronunciation of some homograph words, determine the semantic links between words, phrase breaks and intonation phrase boundaries and eventually design the intonation curves. The paper focuses on the development and evaluation of the Romanian POS tagger. The findings of this paper show that Naive Bayes models can very well be used for tagging in a hybrid system composed of trained statistical model and a word database. Our experimental results have uncovered an acceptable accuracy and real time performance of the integrated model using a reduced tag set.
机译:本文介绍了为罗马尼亚语文本到语音目的而开发的词性(POS)标记器。在我们的文本语音转换(TTS)系统中,语音部分标记器用于消除某些同形异义词的发音的歧义,确定单词之间的语义联系,短语中断和语调短语边界,并最终设计语调曲线。本文重点介绍罗马尼亚POS标记器的开发和评估。本文的发现表明,朴素贝叶斯模型可以很好地用于由训练统计模型和单词数据库组成的混合系统中的标记。我们的实验结果发现,使用减少的标签集,可以使集成模型具有可接受的准确性和实时性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号