首页> 外文会议> >Research of automatic Chinese word segmentation
【24h】

Research of automatic Chinese word segmentation

机译:中文自动分词研究

获取原文

摘要

Automatic Chinese word segmentation is the fundamental task of Chinese information processing. At present ambiguous phrase segmentation and proper name recognition are two obstacles to the performances of Chinese word segmentation systems. We apply a corpus-based method to extract various language phenomena from real texts, and combine a statistical model with rules in Chinese word segmentation, which has increased the precision of segmentation by improving ambiguous phrase segmentation and unknown word recognition, and finally, we describe a Chinese word segmentation system developed by Shanxi University.
机译:自动中文分词是中文信息处理的基本任务。目前,模棱两可的短语分割和专有名称识别是阻碍中文分词系统性能的两个障碍。我们采用基于语料库的方法从真实文本中提取各种语言现象,并将统计模型与规则结合在中文分词中,通过改进歧义词组的分词和未知词的识别,提高了分词的精度,最后,我们进行了描述。山西大学开发的中文分词系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号