首页> 外文会议>CIPS-SIGHAN joint conference on Chinese language processing >Chinese Spelling Check System Based on Tri-gram Model
【24h】

Chinese Spelling Check System Based on Tri-gram Model

机译:基于三克模型的汉语拼写检查系统

获取原文

摘要

This paper describes our system in the Chinese spelling check (CSC) task of CLP-SIGHAN Bake-Off 2014. CSC is still an open problem today. To the best of our knowledge, n-gram language modeling (LM) is widely used in CSC because of its simplicity and fair predictive power. Our work in this paper continues this general line of research by using a tri-gram LM to detect and correct possible spelling errors. In addition, we use dynamic programming to improve the efficiency of the algorithm, and additive smoothing to solve the data sparseness problem in training set. Empirical evaluation results demonstrate the utility of our CSC system.
机译:本文介绍了我们在2014年CLP-Sighan烘焙的中文拼写检查(CSC)任务中的系统。CSC今天仍然是一个开放问题。据我们所知,N-GRAM语言建模(LM)广泛用于CSC,因为其简单和公平预测力。我们本文的工作继续通过使用三克LM来检测和纠正可能的拼写错误。此外,我们使用动态编程来提高算法的效率,以及附加平滑,以解决训练集中的数据稀疏问题。经验评估结果证明了CSC系统的效用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号