首页> 外文会议>International Conference on Automation, Electronics and Electrical Engineering >Chinese Text Error Correction Method Based on Prefix Tree Merging
【24h】

Chinese Text Error Correction Method Based on Prefix Tree Merging

机译:基于前缀树合并的中文文本纠错方法

获取原文

摘要

In order to solve the problem of high computational complexity and repetitive calculation of the Long Short-Term Memory (LSTM) language model in the task of Chinese text automatic proofreading, a Chinese text error correction method based on prefix tree merging is proposed in this paper. The method has made the following improvements: different from the traditional error correction method based on N-gram model, the method use LSTM language model to evaluate the rationality of the candidate sentences, and the candidate sentences with higher similarity are combined into a tree structure and then scored. Repetitive calculations can be reduced by merging the same calculations when the model calculates the probability of candidate sentences, which thereby could improve the calculation efficiency of the LSTM language model. The experimental results show that this method can not only achieve good error correction accuracy, but also shorten the time consumed and improve the error correction efficiency.
机译:为了解决高计算复杂性的问题和重复计算的长短期内存(LSTM)语言模型在中文文本自动校对任务中,本文提出了一种基于前缀树合并的中文纠错方法。该方法已经提出了以下改进:与基于N-GRAM模型的传统纠错方法不同,方法使用LSTM语言模型来评估候选句子的合理性,并且较高相似性的候选句子组合成树结构然后得分。通过合并模型计算候选句子的概率时,可以通过合并相同的计算来减少重复计算,从而可以提高LSTM语言模型的计算效率。实验结果表明,这种方法不仅可以达到良好的纠错精度,还可以缩短耗时和提高纠错效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号