首页> 外国专利> Morphological/phonetic method for ranking word similarities

Morphological/phonetic method for ranking word similarities

机译:词法相似度的形态/语音方法

摘要

A computer method is disclosed for ranking word similarities which is applicable to a variety of dictionary applications such as synonym generation, linguistic analysis, document characterization, etc. The method is based upon transforming an input word string into a key word which s invariant for certain types of errors in the input word, such as the doubling of letters, consonant/vowel transpositions, consonant/consonant transpositions. The specific mapping technique is a morphological mapping which generates keys which will have similarities that can be detected during a subsequent ranking procedure. The mapping is defined such that unique consonants of the input word are listed in their original order followed by the unique vowels for the input words, also in their original order. The keys thus generated will be invariant for consonant/vowel transpositions or doubled letters. The utility of the keys is further improved by arranging the consonants in the keys in alphabetical order followed by arranging the vowels in the keys in alphabetical order. The resultant mapping is insensitive to consonant/consonant transpositions, as well as consonant/vowel transpositions and doubled letters. The method then continues by applying a ranking technique which makes use of a compound measure of similarity for ranking the key words.
机译:公开了一种用于对单词相似度进行排名的计算机方法,该方法适用于各种词典应用,例如同义词生成,语言分析,文档表征等。该方法基于将输入单词字符串转换为对于某些单词不变的关键字输入单词中错误的类型,例如字母的倍增,辅音/元音变位,辅音/辅音变位。特定的映射技术是一种形态映射,它生成具有相似性的键,可以在后续的排序过程中检测到相似性。定义映射,以使输入单词的唯一辅音以其原始顺序列出,然后是输入单词的唯一元音,也以其原始顺序列出。如此生成的键对于辅音/元音换位或双倍字母不变。通过将辅音按字母顺序排列在键中,然后将元音按字母顺序排列在键中,进一步提高了键的实用性。生成的映射对辅音/辅音换位以及辅音/元音换位和双字母不敏感。然后,该方法通过应用排名技术继续进行,该排名技术利用相似度的复合度量来对关键字进行排名。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号