首页> 外国专利> System and method for spelling correction of non-Roman letters and words

System and method for spelling correction of non-Roman letters and words

机译:非罗马字母和单词的拼写校正系统和方法

摘要

Systems and methods to process and correct spelling errors for non-Roman based words such as in Chinese, Japanese, and Korean languages using a rule-based classifier and a hidden Markov model are disclosed. The method generally includes converting an input entry in a first language such as Chinese to at least one intermediate entry in an intermediate representation, such as pinyin, different from the first language, converting the intermediate entry to at least one possible alternative spelling or form of the input in the first language, and determining that the input entry is either a correct or questionable input entry when a match between the input entry and all possible alternative spellings to the input entry is or is not located, respectively. The questionable input entry may be classified using, for example, a transformation rule based classifier based on transformation rules generated by a transformation rules generator.
机译:公开了使用基于规则的分类器和隐马尔可夫模型来处理和纠正诸如中文,日文和韩文的非罗马词的拼写错误的系统和方法。该方法通常包括:将第一语言(例如中文)的输入条目转换为中间表示形式(例如拼音)中的至少一个不同于第一语言的中间条目,将中间条目转换为至少一种可能的替代拼写或形式以第一语言输入,并且当输入条目与输入条目的所有可能替代拼写之间的匹配项分别位于或未找到时,确定输入条目是正确的还是可疑的输入条目。可以使用例如基于变换规则生成器生成的变换规则的基于变换规则的分类器来对可疑输入条目进行分类。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号