首页> 外文会议>International Conference on Electrical Engineering and Informatics >A non word error spell checker for Indonesian using morphologically analyzer and HMM
【24h】

A non word error spell checker for Indonesian using morphologically analyzer and HMM

机译:使用形态学分析仪和肝脏的印度尼西亚的非单词错误拼写检查

获取原文

摘要

Spell checker consists of two main methods, error detection and error correction. In this study, spell checker is built by using morphological analyzer and dictionary lookup as error detection method with two alternative optimization, binary search and hash. Whilst as for error correction, two alternative methods, namely forward reversed dictionary and probability of similarity is used. Forward reversed dictionary corrects the misspelled word by considering edit distance between the misspelled word and its candidates. Probability of similarity, which is the main proposed method for error correction, correct the misspelled word by calculating its similarity to a candidate word, based on the value of optimum subsequence between them. Candidate sorting was accomplished through the use of HMM (Hidden Markov Model), where the word is considered as observed state and the candidates as hidden state. By using HMM, the system does not only consider the similarity of the candidate word with misspelled words, but also consider the sequence of words in sentences where the word is located. The experiment result proves that sorting candidates by using HMM increase the precision accuracy. As for correction method, the result showed that using probability of similarity has better correctness accuracy than forward reversed dictionary.
机译:拼写检查包括两种主要方法,错误检测和纠错。在本研究中,使用形态分析仪和字典查找作为错误检测方法构建拼写检查,具有两个替代优化,二进制搜索和哈希。虽然对于纠错,但使用了两个替代方法,即前向反转字典和相似度的概率。前向颠倒的字典通过考虑拼写错误的单词及其候选人之间的编辑距离来纠正拼写错误的单词。相似性的概率,这是纠错的主要提出方法,通过基于它们之间的最佳子序列的值来计算其与候选词的相似性来纠正拼错单词。通过使用HMM(隐马尔可夫模型)来完成候选分类,其中单词被认为是观察状态和候选作为隐藏状态。通过使用嗯,系统不仅考虑候选词与拼写错误的单词的相似性,还考虑单词所在的句子中的单词序列。实验结果证明了通过使用HMM进行分类候选,提高了精度精度。对于校正方法,结果表明,使用相似性的概率具有比向前反向字典的更好的正确性精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号