首页> 外国专利> Feature-Based Deduplication of Metadata for Places

Feature-Based Deduplication of Metadata for Places

机译:基于功能的代码数据数据删除

摘要

The technology disclosed relates to deduplicating metadata about places. A feature generator module is configured to generate features for metadata profiles. The metadata profiles represent a plurality of places. The features are based on geohash strings and word embeddings generated for the metadata profiles. A diff generator module is configured to generate diff vectors that pair-wise encode results of comparison between features of paired metadata profiles. A classification module is configured to generate similarity scores for the paired metadata profiles based on the diff vectors. A particular similarity score indicates whether metadata profiles in a particular pair of metadata profiles represent a same place.
机译:所公开的技术涉及重复数据删除的元数据。特征生成器模块被配置为为元数据配置文件生成功能。元数据配置文件代表多个位置。该功能基于为元数据配置文件生成的Geohash字符串和Word Embedings。 Diff生成器模块被配置为生成配对编码对成对元数据配置文件特征之间的比较结果的Dive向量。分类模块被配置为基于Diff向量生成用于成对元数据配置文件的相似性分数。特定的相似度分数指示特定对元数据配置文件中的元数据配置文件是否表示相同的位置。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号