首页> 外文会议>CIPS-SIGHAN joint conference on Chinese language processing >Introduction to NJUPT Chinese Spelling Check Systems in CLP-2014 Bakeoff
【24h】

Introduction to NJUPT Chinese Spelling Check Systems in CLP-2014 Bakeoff

机译:在CLP-2014 BAKEOFF中的Njupt中文拼写检查系统简介

获取原文
获取外文期刊封面目录资料

摘要

Chinese spelling check (CSC) is an essential issue in the research field of Chinese language processing (CLP). This paper describes the details of two CSC systems we developed to solve this problem. The first system was built based on CRF model, and the modules of such system include word segmentation, error detection and error correction. Another system was based on 2-Chars&&3-Chars model, and its modules include bigram segmentation, error detection and error correction. Using the final test data set provided by CLP2014, the final experimental result of the system based on 2-Chars&&3-Chars model was better, which achieved 0.403 detection accuracy with 0.3344 detection precision and 0.3964 correction accuracy with 0.3191 correction precision.
机译:中国拼写检查(CSC)是汉语处理研究领域的重要问题(CLP)。本文介绍了我们开发出解决此问题的两个CSC系统的细节。第一个系统是基于CRF模型构建的,并且这种系统的模块包括字分割,错误检测和纠错。另一个系统基于2字符&& 3字符模型,其模块包括Bigram分段,错误检测和纠错。使用CLP2014提供的最终测试数据集,基于2-CHAR && 3-CHAR模型的系统的最终实验结果更好,其达到0.403检测精度,检测精度为0.3344检测精度,0.3964校正精度为0.3191校正精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号