首页> 外国专利> AUTOMATIC, COMPUTER-BASED SIMILARITY CALCULATION SYSTEM FOR QUANTIFYING THE SIMILARITY OF TEXT EXPRESSIONS

AUTOMATIC, COMPUTER-BASED SIMILARITY CALCULATION SYSTEM FOR QUANTIFYING THE SIMILARITY OF TEXT EXPRESSIONS

机译:基于计算机的自动相似度计算系统,用于量化文本表达的相似度

摘要

The invention relates to a device and a method for the automatic, computer-based weighting of the similarity of text expressions. The inventive system or method comprises a document database unit (1), a candidate expression storage unit (2), and a similarity weight value calculation unit (3) while being characterized in that the similarity weight values agw(t1, t2) for the individual pairs of expressions can be calculated based on a degree of similarity occ_con(t1, t2) that takes into account both the total frequency with which the two expressions of a pair of expressions are used within one and the same text segment in a number of several text segments and the total number of different context expressions in said number of text segments.
机译:本发明涉及一种用于对文本表达的相似性进行自动的,基于计算机的加权的设备和方法。本发明的系统或方法包括文档数据库单元(1),候选表达存储单元(2)和相似性权重值计算单元(3),同时其特征在于,针对该相似性权重值agw(t1,t2)。可以基于相似度occ_con(t1,t2)计算单个表达式对,该相似度考虑了一对表达式中两个表达式在一个文本段和多个文本段中使用的总频率。多个文本段,以及所述多个文本段中不同上下文表达的总数。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号