首页> 中文期刊>现代电子技术 >中文分词歧义识别算法的优化

中文分词歧义识别算法的优化

     

摘要

The performance of Chinese word segmentation system directly influences the subsequent work, in which the ambiguity words should be recognized and processed accurately. The processing effect is a very important sign of measuring a segmentation system. In order to solve the ambiguity problem, the ambiguity words have to be found first. An algorithm combining literal scanning algorithm with reverse maximum matching algorithm is proposed on the basis of increasing maximum matching algorithm. It can be proved that the efficiency of this algorithm is better than the increasing maximum matching algorithm.%中文分词系统性能的好坏直接影响到后续的工作,而歧义字段的处理更是衡量一个分词系统好坏的重要标志.解决歧义问题前首先就要找到歧义字段,本文在之前的增字最大匹配算法基础上,提出了一种结合逐字扫描算法和逆向最大匹配算法的歧义字段识别方法.实验结果表明,这里提出的算法执行效率要比增字最大匹配算法效率高,速度更快.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号