首页> 外国专利> Automated computation of semantic similarity of pairs of named entity phrases using electronic document corpora as background knowledge

Automated computation of semantic similarity of pairs of named entity phrases using electronic document corpora as background knowledge

机译：使用电子文档语料库作为背景知识自动计算成对的命名实体短语对的语义相似度

页面导航

摘要
著录项
相似文献

摘要

An overall semantic similarity score value between pairs of named entities in a text corpus is obtained by calculating for at least one pair of named entities a plurality of corresponding pair similarity score values according to a first and at least a second classifier using electronic information sources. Each pair similarity score value of the pair of named entities per classifier is normalized by calculating a rank list per classifier, for example, for each named entity. The rank list holds each pair of named entities of the text corpus, wherein a rank of each pair of named entities within the rank list reflects the respective pair similarity score value. Further an arithmetic mean of the normalized pair similarity score value of each pair of named entities is calculated to provide the overall semantic similarity score value.

机译：通过使用电子信息源根据第一和至少第二分类器为至少一对命名实体计算多个对应的对相似性得分值来获得文本语料库中的成对命名实体之间的总体语义相似性得分值。通过例如针对每个命名实体计算每个分类器的等级列表来归一化每个分类器对的一对命名实体的每个对相似性得分值。等级列表保存文本语料库的每对命名实体，其中，等级列表内的每对命名实体的等级反映相应的对相似度得分值。此外，计算每对命名实体的归一化对相似度得分值的算术平均值，以提供整体语义相似度得分值。

著录项

公开/公告号US8170969B2

专利类型
公开/公告日2012-05-01

原文格式PDF
申请/专利权人 HANS ROETTGER;CAI-NICOLAS ZIEGLER;
展开▼

申请/专利号US20080246894
发明设计人 HANS ROETTGER;CAI-NICOLAS ZIEGLER;
展开▼

申请日2008-10-07
分类号G06F17;
国家 US
入库时间 2022-08-21 17:26:45

相似文献

专利
外文文献
中文文献