首页> 外国专利> Similarity-based clustering search engine

Similarity-based clustering search engine

机译：基于相似度的聚类搜索引擎

页面导航

摘要
著录项
相似文献

摘要

A search engine identifies external data records that describe similar entities and may each conform to a different data format or source schema. The engine derives mappings capable of translating data values between differently formatted attributes of two source schemas and uses these mappings to identify degrees of similarity between attributes and schemas. When the search engine receives a search request, the engine translates submitted search criteria into values of a first schema's attributes and then uses the mappings to map those values onto selected attributes of other schemas. The search engine then uses each schema's selected attributes to select external data records formatted in that schema. Each selected record is assigned a match score that is weighted by the similarity of the record schema's selected attributes to the search criteria. Records are then retrieved in order of decreasing match score.

机译：搜索引擎标识描述相似实体的外部数据记录，并且每个外部数据记录可能符合不同的数据格式或源架构。该引擎派生出能够在两个源模式的不同格式属性之间转换数据值的映射，并使用这些映射来识别属性和模式之间的相似度。当搜索引擎收到搜索请求时，引擎会将提交的搜索条件转换为第一模式属性的值，然后使用映射将这些值映射到其他模式的选定属性上。然后，搜索引擎使用每个模式的选定属性来选择以该模式格式化的外部数据记录。为每个选定的记录分配一个匹配分数，该分数由记录架构的选定属性与搜索条件的相似性加权。然后按比赛得分递减的顺序检索记录。

著录项

公开/公告号US10691652B2

专利类型
公开/公告日2020-06-23

原文格式PDF
申请/专利权人 INTERNATIONAL BUSINESS MACHINES CORPORATION;
展开▼

申请/专利号US201815939393
发明设计人 WEI YAN;JIE WANG;JIAN HUI CHEN;YU ZHU XU;XIAO BO REN;JIAN LU;
展开▼

申请日2018-03-29
分类号G06F17;G06F16/21;G06F16/28;G06F16/951;G06F16/2457;
国家 US
入库时间 2022-08-21 11:31:09

相似文献

专利
外文文献
中文文献