首页> 外文会议>Association for Computational Linguistics Annual Meeting >Hybrid Methods for POS Guessing of Chinese Unknown Words
【24h】

Hybrid Methods for POS Guessing of Chinese Unknown Words

机译:POS猜测中文未知词的混合方法

获取原文

摘要

This paper describes a hybrid model that combines a rule-based model with two statistical models for the task of POS guessing of Chinese unknown words. The rule-based model is sensitive to the type, length, and internal structure of unknown words, and the two statistical models utilize contextual information and the likelihood for a character to appear in a particular position of words of a particular length and POS category. By combining models that use different sources of information, the hybrid model achieves a precision of 89%, a significant improvement over the best result reported in previous studies, which was 69%.
机译:本文介绍了一种混合模型,它与基于规则的模型与两个统计模型相结合,为POS猜测中文未知词的任务。基于规则的模型对未知单词的类型,长度和内部结构敏感,并且两个统计模型利用上下文信息和字符的可能性出现在特定长度和POS类别的单词的特定位置。通过组合使用不同信息来源的模型,混合模型达到了89%的精度,对以往的研究中报告的最佳结果的显着改善,这是69%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号