首页> 外文会议>Advances in Natural Language Processing >A POS-Based Word Prediction System for the Persian Language
【24h】

A POS-Based Word Prediction System for the Persian Language

机译:基于POS的波斯语单词预测系统

获取原文
获取原文并翻译 | 示例

摘要

Word prediction is the problem of guessing the words which are likely to follow in a given text segment by displaying a list of the most probable words that could appear in that position. In this research, we designed and implemented three word predictors for Persian. Our baseline is a statistical-based system which uses language models. The first system uses word statistics; in the second one we use the main syntactic categories of a Persian POS tagged corpus; and the last one uses the main syntactic categories along with their morphological, syntactic and semantic subcategories. Using Keystroke Saving (KSS) as the most important metrics to evaluate systems' performance, the primary word-based statistical system achieved 37% KSS, and the second system that used only the main syntactic categories with word-statistics achieved 38.95% KSS. Our last system which used all of the available information to the words get the best result by 42.45% KSS.
机译:单词预测是通过显示可能出现在该位置的最可能单词的列表来猜测在给定文本段中可能跟随的单词的问题。在这项研究中,我们为波斯语设计并实现了三个单词预测器。我们的基准是使用语言模型的基于统计的系统。第一个系统使用单词统计;在第二篇文章中,我们使用波斯语POS标记语料库的主要句法类别;最后一个使用主要的句法类别以及它们的形态,句法和语义子类别。使用击键节省(KSS)作为评估系统性能的最重要指标,基于单词的主要统计系统达到了37%的KSS,而仅使用主要句法类别进行单词统计的第二个系统达到了38.95%的KSS。我们的最后一个系统使用所有可用信息作为单词,从而以42.45%KSS获得最佳结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号