首页> 中文期刊> 《中国电子杂志(英文版)》 >An Iterative Method for Extracting Chinese Unknown Words

An Iterative Method for Extracting Chinese Unknown Words

         

摘要

An iterative method for extractingunknown words from a Chinese text corpus is pro-posed in this paper. Unlike traditional non-iterativesegmentation-detection approaches, which use onlyknown words for segmentation, the proposed methoditeratively extracts new words and adds them into thelexicon. Then the augmented dictionary, which in-cludes known words and potential unknown words, isused in the next iteration to re-segment the input cor-pus. Experiments show that both the precision andrecall rates of segmentation are improved.

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号