首页> 外文会议> >Automatically acquiring part of speech correcting rules of multi-category words based on incomplete decision tables
【24h】

Automatically acquiring part of speech correcting rules of multi-category words based on incomplete decision tables

机译:基于不完整决策表自动获取多类别词的部分词性纠正规则

获取原文

摘要

Part of speech (POS) tagging is a basic subject for Chinese information processing. In general, the existence of multi-category words greatly affects the processing quality of corpora. High efficient methods and automatically correcting techniques for multi-category word tagging are the keys for improving tagging precision. In this paper, for part of speech correcting of multi-category word, a modeling method is introduced based on an incomplete decision table and two algorithms for attribute reduction and object reduction used for automatically acquiring correcting rules are presented based on attribute significance. The results of testing show the validity of our method for improving part of speech tagging precision in large corpora engineering.
机译:词性(POS)标记是中文信息处理的基本主题。通常,多类别词的存在极大地影响了语料库的处理质量。用于多类别词标记的高效方法和自动更正技术是提高标记精度的关键。本文针对多类别词的语音校正,提出了一种基于不完备决策表的建模方法,提出了两种基于属性重要性的属性约简和对象约简算法,用于自动获取校正规则。测试结果表明,该方法在大型语料库工程中提高部分语音标注精度的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号