首页> 外文期刊>ACM transactions on Asian language information processing >Chinese Spelling Checker Based on an Inverted Index List with a Rescoring Mechanism
【24h】

Chinese Spelling Checker Based on an Inverted Index List with a Rescoring Mechanism

机译:基于倒排索引的带有评分机制的中文拼写检查器

获取原文
获取原文并翻译 | 示例

摘要

An approach is proposed for Chinese spelling error detection and correction, in which an inverted index list with a rescoring mechanism is used. The inverted index list is a structure for mapping from word to desired sentence, and for representing nodes in lattices constructed through character expansion (according to predefined phonologically and visually similar character sets). Pruning based on a contextual dependency confidence measure was used to markedly reduce the search space and computational complexity. Relevant mapping relations between the original input and desired input were obtained using a scoring mechanism composed of class-based language and maximum entropy correction models containing character, word, and contextual features. The proposed method was evaluated using data sets provided by SigHan 7 bakeoff. The experimental results show that the proposed method achieved acceptable performance in terms of recall rate or precision rate in error sentence detection and error location detection, and it outperformed other approaches in error location detection and correction.
机译:提出了一种中文拼写错误检测与纠正的方法,该方法采用了具有评分机制的倒排索引表。倒排索引列表是一种结构,用于从单词映射到所需句子,并表示通过字符扩展(根据预定义的语音和视觉相似字符集)构造的网格中的节点。基于上下文相关性置信度度量的修剪可显着减少搜索空间和计算复杂性。使用基于类的语言和包含字符,单词和上下文特征的最大熵校正模型组成的评分机制,可以获取原始输入和所需输入之间的相关映射关系。使用SigHan 7 bakeoff提供的数据集对提出的方法进行了评估。实验结果表明,该方法在错误句检测和错误位置检测中的查全率或查准率均达到了可接受的性能,在错误位置检测和纠正中优于其他方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号