首页> 外国专利> Locating digital coded words which are both acceptable misspellings and acceptable inflections of digital coded query words

Locating digital coded words which are both acceptable misspellings and acceptable inflections of digital coded query words

机译:查找既是数字编码查询词的可接受的拼写错误又是可接受的变形的数字编码的词

摘要

A method is disclosed using a digital data processing means for determining from a plurality of candidate words at least one which is both an acceptable spelling and an acceptable inflection of a query word. The words are represented by machine readable coded signals and comprise plural characters. The steps are as follows: Determine a stem portion of such query word. Form a suffix class indication for any one of a plurality of classes in which the query word may be included. Compare the determined query stem with characters in the beginning of such candidate words for finding acceptable and nonacceptable spelling matches. Determine an ending portion, if any, in each individual candidate words which is an acceptable spelling match. Utilize the suffix class indication to select a representation of at least one acceptable suffix for the candidate words. Compare a representation of the at least one selected acceptable suffix and the determined ending portions in the individual candidate words which are acceptable spelling matches to determine at least one predetermined acceptable relation therebetween.
机译:公开了一种使用数字数据处理装置的方法,该方法用于从多个候选词中确定至少一个既是可接受的拼写形式又是可接受的查询词变形形式。单词由机器可读的编码信号表示,并且包括多个字符。步骤如下:确定此类查询词的词干部分。为可以包括查询词的多个类别中的任何一个形成后缀类别指示。将确定的查询词根与此类候选单词开头的字符进行比较,以查找可接受和不可接受的拼写匹配项。确定每个候选单词中的结尾部分(如果有),这是可以接受的拼写匹配。利用后缀类别指示为候选单词选择至少一个可接受的后缀的表示形式。比较至少一个选择的可接受后缀的表示和各个候选词中确定的结尾部分的表示,这些候选词是可接受的拼写匹配,以确定它们之间的至少一个预定的可接受关系。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号