首页> 外文会议>International Conference on speech and computer >Automatic Morphological Annotation in a Text-to-Speech System for Hebrew
【24h】

Automatic Morphological Annotation in a Text-to-Speech System for Hebrew

机译:希伯来语文本到语音系统中的自动形态注释

获取原文

摘要

The paper presents the module for automatic morphological annotation within a text synthesizer for Hebrew, based on an efficient combination of two approaches. The first approach includes the selection of lexemes from appropriate lexica, while the other approach involves automatic morphological analysis of text input using a complex expert algorithm relying on a set of transformational rules and using 6 types of scoring procedures. The module operates on a set of 30 part-of-speech tags with more than 3000 corresponding morphological categories. The paper discusses the advantages of the proposed method in the context of an extremely morphologically complex language such as Hebrew, with particular emphasis given to the relative importance of individual scoring procedures. When all 6 scoring procedures are applied, the accuracy of 99.6% is achieved on a corpus of 3093 sentences (55046 words).
机译:本文基于两种方法的有效结合,提出了希伯来语文本合成器中用于自动形态注释的模块。第一种方法包括从适当的词典中选择词素,而另一种方法包括使用复杂的专家算法(依赖一组转换规则并使用6种评分程序)对文本输入进行自动形态分析。该模块在30个词性标签集上运行,具有3000多个相应的形态学类别。本文讨论了在极复杂的形态学语言(例如希伯来语)的情况下所提出的方法的优势,并特别强调了单个计分程序的相对重要性。当应用所有6个计分程序时,对3093个句子(55046个单词)的语料库的准确率达到99.6%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号