首页> 外文会议> >New methods for detecting characters wrongly deleted and inserted in Japanese strings and their applicability to DNA chains
【24h】

New methods for detecting characters wrongly deleted and inserted in Japanese strings and their applicability to DNA chains

机译:检测日语字符串中错误删除和插入的字符的新方法及其对DNA链的适用性

获取原文

摘要

This paper proposes methods to detect and to correct the characters wrongly inserted and deleted in natural language. Natural language is physically different from DNA, however it has a lot of common characteristics in point of medium representing information. Accordingly the methods proposed here are expected to be applied to detect errors in DNA chains. In optical character recognition and continuous speech recognition of a natural language, it has been difficult to detect error characters which are wrongly deleted and inserted. In order to detect and correct these errors, this paper proposes new methods using m-th order Markov chain model for Japanese syllables and "kanji-kana" characters, assuming that Markov probability of a correct chain of syllables or "kanji-kana" characters is greater than that of erroneous chains. From the results of the experiments, it is concluded that the method is useful for detecting as well as correcting these errors.
机译:本文提出了检测和纠正自然语言中错误插入和删除的字符的方法。自然语言在物理上与DNA不同,但是就代表信息的媒介而言,它具有许多共同的特征。因此,预期本文提出的方法将用于检测DNA链中的错误。在自然语言的光学字符识别和连续语音识别中,难以检测被错误删除和插入的错误字符。为了检测和纠正这些错误,本文提出了一种使用m阶马尔可夫链模型的日语音节和“汉字假名”字符的新方法,并假设正确的音节链或“汉字假名”字符的马尔可夫概率比错误链更大。从实验结果可以得出结论,该方法对于检测和校正这些误差是有用的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号