首页> 外文会议>International Applied Mechanics, Mechatronics Automation System Simulation Meeting >Discriminate Chinese Word Segmenter with Global and Context Features
【24h】

Discriminate Chinese Word Segmenter with Global and Context Features

机译:以全局和上下文特征识别中文字分段器

获取原文

摘要

Chinese Word segmenter is the basis for all subsequent applications of natural language processing. The Corpus-based statistic method has become the predominant method. However, the training corpora are not enough especially in certain areas. Therefore, we introduce some global features and context features in order to get almost the same performance only with much smaller scale corpus. The experiments results show that our approach significantly outperforms the original feature sets in the same training data. Meanwhile, the time-consuming of model training is also reduced. In addition, these features do not depend on classifiers, so our method can easily be changed to other models.
机译:中文字段器是自然语言处理的所有后续应用的基础。基于语料库的统计方法已成为主要方法。但是,培训小组尤其是在某些领域不够。因此,我们介绍了一些全局功能和上下文功能,以便仅具有更小的规模语料库来获得几乎相同的性能。实验结果表明,我们的方法显着优于同一训练数据中的原始功能集。同时,模型培训的耗时也减少了。此外,这些功能不依赖于分类器,因此我们的方法可以很容易地更改为其他模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号