首页> 外国专利> TEXT SIMILARITY CALCULATION METHOD AND DEVICE, COMPUTER APPARATUS, AND STORAGE MEDIUM

TEXT SIMILARITY CALCULATION METHOD AND DEVICE, COMPUTER APPARATUS, AND STORAGE MEDIUM

机译:文本相似度计算方法和装置,计算机设备和存储介质

摘要

A text similarity calculation method comprises: obtaining a character sequence to be matched and a target character sequence; respectively preprocessing the character sequence to be matched and the target character sequence to obtain a corresponding word sequence to be matched and a corresponding target word sequence; performing calculation, by means of a first similarity algorithm, on a word to be matched contained in the word sequence to be matched and a target word contained in the target word sequence, so as to obtain a first similarity degree; extracting all of the words to be matched, so as to form a set of words to be matched, and extracting all of the target words to form a target word set; performing calculation on the set of words to be matched and the target word set by means of a second similarity algorithm, so as to obtain a second similarity degree; and calculating, according to the first similarity degree and the second similarity degree, a text similarity degree between the character sequence to be matched and the target character sequence.
机译:一种文本相似度计算方法,包括:获得待匹配的字符序列和目标字符序列;分别对待匹配的字符序列和目标字符序列进行预处理,得到对应的待匹配词序列和目标对象词序列;通过第一相似度算法,对待匹配词序列中包含的待匹配词与目标词序列中包含的目标词进行计算,得到第一相似度;提取所有待匹配词,以形成一组待匹配词,并提取所有目标词,形成目标词集;通过第二相似度算法对待匹配词集和目标词集进行计算,得到第二相似度;根据所述第一相似度和所述第二相似度,计算所述待匹配字符序列与所述目标字符序列之间的文本相似度。

著录项

  • 公开/公告号WO2019136993A1

    专利类型

  • 公开/公告日2019-07-18

    原文格式PDF

  • 申请/专利号WO2018CN99994

  • 发明设计人 AI MING;

    申请日2018-08-10

  • 分类号G06F17/27;

  • 国家 WO

  • 入库时间 2022-08-21 11:53:54

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号