首页> 外国专利> SYSTEM AND METHOD FOR SPELLING CORRECTION OF NON-ROMAN CHARACTER AND WORD

SYSTEM AND METHOD FOR SPELLING CORRECTION OF NON-ROMAN CHARACTER AND WORD

机译:拼写非罗马字符和单词的系统和方法

摘要

PROBLEM TO BE SOLVED: To provide a system and a method for processing and correcting misspelling of a word based upon non-Roman characters, such as Chinese, Japanese, and Korean, using rule-based classifiers and hidden Markov models.SOLUTION: The method includes the processes of: converting an input entry in a first language generally like Chinese into at least one intermediate entry in intermediate representation like Pinyin different from the first language; converting the intermediate entry into at least one possible alternative spelling or format of the input entry in the first language; and determining whether the input entry is an accurate input entry or a doubtful input entry respectively when coincidence between the input entry and all possible alternative spellings therefor is specified or nor specified. The doubtful input entry can be classified based upon a conversion rule generated by, for example, a conversion rule generator, by using classifiers based upon the conversion rule.
机译:解决的问题:提供一种系统和方法,用于使用基于规则的分类器和隐马尔可夫模型来处理和纠正基于非罗马字符(例如中文,日文和韩文)的单词的拼写错误。包括以下过程:将通常为中文的第一语言的输入条目转换为与第一语言不同的诸如中间拼音的中间表示的至少一个中间条目;将中间条目转换成第一语言的输入条目的至少一种可能的替代拼写或格式;当输入条目与其所有可能的替代拼写之间的一致性被指定或未指定时,分别确定输入条目是准确的输入条目还是可疑的输入条目。可以基于例如由转换规则生成器生成的转换规则,通过使用基于转换规则的分类器来对可疑输入条目进行分类。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号