首页> 外文会议>CIPS-SIGHAN joint conference on Chinese language processing >Word Segmenter for Chinese Micro-blogging Text Segmentation - Report for CIPS-SIGHAN'2014 Bakeoff
【24h】

Word Segmenter for Chinese Micro-blogging Text Segmentation - Report for CIPS-SIGHAN'2014 Bakeoff

机译:用于中文微博文本分割的分词器-CIPS-SIGHAN 2014总结报告

获取原文

摘要

This paper presents our system for the CIPS-SIGHAN-2014 bakeoff task of Chinese word segmentation. This system adopts a character-based joint approach, which combines a character-based generative model and a character-based discriminative model. To further improve the performance in cross-domain, an external dictionary is employed. In addition, pre-processing and post-processing rules are utilized to further improve the performance. The final performance on the test corpus shows that our system achieves comparable results with other state-of-the-art systems.
机译:本文介绍了用于CIPS-SIGHAN-2014汉字分词任务的系统。该系统采用基于字符的联合方法,该方法结合了基于字符的生成模型和基于字符的判别模型。为了进一步提高跨域性能,使用了外部字典。另外,利用预处理和后处理规则来进一步提高性能。测试语料库的最终性能表明,我们的系统可获得与其他最新系统相当的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号