首页> 外国专利> SEMANTIC SIMILARITY ANALYSIS TO DETERMINE RELATEDNESS OF HETEROGENEOUS DATA

SEMANTIC SIMILARITY ANALYSIS TO DETERMINE RELATEDNESS OF HETEROGENEOUS DATA

机译:确定异构数据相关性的语义相似度分析

摘要

A method and system to determine relatedness select a first customer observable from a first source document, the first customer observable being made up of two terms, the two terms being a first term of a first type and a first term of a second type, and select a second customer observable from a second source document, the second customer observable being made up of a second term of the first type and a second term of the second type. The method includes creating a first corpus of all documents that include the first terms, creating a second corpus of all documents that include the second terms, obtaining other first terms in the first corpus and other second in the second corpus, and performing semantic similarity analysis to determine a similarity score between the first customer observable and the second customer observable.
机译:一种用于确定相关性的方法和系统,其从第一源文档中选择可观察的第一顾客,该第一顾客可观察物由两个项组成,这两个项是第一类型的第一项和第二类型的第一项,以及从第二个源文档中选择一个可观察的第二个客户,该第二个可观察的客户由第一类型的第二项和第二类型的第二项组成。该方法包括:创建包括第一术语的所有文档的第一语料库;创建包括第二术语的所有文档的第二语料库;在第一语料库中获得其他第一术语,在第二语料库中获得其他第二术语;以及执行语义相似性分析确定第一可观察客户与第二可观察客户之间的相似度分数。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号