首页> 外国专利> APPARATUS OF SENTENCE SIMILARITY CALCULATION USING MORPHEME TRANSFORM TYPE AND METHOD THEREOF

APPARATUS OF SENTENCE SIMILARITY CALCULATION USING MORPHEME TRANSFORM TYPE AND METHOD THEREOF

机译:形态转化类型的句子相似度计算装置及其方法

摘要

The present invention relates to a device and a method for calculating sentence similarity by changing a sentence order and a sentence structure. The method for calculating the sentence similarity by using the device for calculating the sentence similarity comprises: a step of extracting a plurality of morphemes from a comparative object sentence; a step of generating a hash value of the comparative object sentence by using a character code value and a position code value of each of morphemes; a step of determining a morpheme transformation type of the comparative object sentence by comparing the hash value of the comparative object sentence with a hash value of a stored original sentence; and a step of calculating similarity of the comparative object sentence and the original sentence by using the morpheme transformation type. The morpheme transformation type includes a first type of changing contents of a morpheme in the comparative object sentence; a second type of changing arrangement of the morpheme in the comparative object sentence; and a third type of omitting the morpheme in the comparative object sentence. The present invention accurately confirms the similarity between the sentences by calculating the similarity of the sentence according to the morpheme transformation type. The present invention is provided to calculate the similarity by considering a distance between transformed morphemes or a distance between the center of a sentence and a transformed morpheme, thereby accurately calculating the similarity in comparison to similarity calculation through simple pattern comparison.
机译:本发明涉及一种通过改变句子顺序和句子结构来计算句子相似度的设备和方法。通过使用句子相似度计算装置来计算句子相似度的方法包括:从比较对象句子中提取多个词素的步骤;通过使用每个词素的字符代码值和位置代码值来生成比较对象语句的哈希值的步骤;通过将比较对象语句的哈希值与存储的原始语句的哈希值进行比较来确定比较对象语句的语素转换类型的步骤;利用语素转换类型计算比较对象句与原句的相似度的步骤。语素转换类型包括在比较对象语句中改变语素的内容的第一类型。第二种在比较宾语中的词素变化安排;第三类是在比较宾语中省略词素。本发明通过根据词素转换类型计算句子的相似度来准确地确认句子之间的相似度。提供本发明以通过考虑变换的词素之间的距离或句子的中心与变换的词素之间的距离来计算相似度,从而与通过简单模式比较的相似度计算相比准确地计算相似度。

著录项

  • 公开/公告号KR101663453B1

    专利类型

  • 公开/公告日2016-10-07

    原文格式PDF

  • 申请/专利权人 BEYONDTECH INC.;

    申请/专利号KR20160098919

  • 发明设计人 LEE JAE CHENG;LEE SANG WOO;LEE SUNG GEUN;

    申请日2016-08-03

  • 分类号G06F17/27;

  • 国家 KR

  • 入库时间 2022-08-21 14:11:51

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号