首页> 外文OA文献 >A Bag Reconstruction Method for Multiple Instance Classification and Group Record Linkage
【2h】

A Bag Reconstruction Method for Multiple Instance Classification and Group Record Linkage

机译:一种用于多实例分类和组记录链接的袋重构方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Record linking is the task of detecting records in several databases that refer to the same entity. This task aims at exploring the relationship between entities, which normally lack common identifiers in heterogeneous datasets. When entities contain multiple relational records, linking them across datasets can be more accurate by treating the records as groups, which leads to group linking methods. Even so, individual record links may still be needed for the final group linking step. This problem can be solved by multiple instance learning, in which group links are modelled as bags, and record links are considered as instances. In this paper, we propose a novel method for instance classification and group record linkage via bag reconstruction from instances. The bag reconstruction is based on the modeling of the distribution of negative instances in the training bags via kernel density estimation. We evaluate this approach on both synthetic and real-world data. Our results show that the proposed method can outperform several baseline methods.
机译:记录链接是在多个引用同一实体的数据库中检测记录的任务。此任务旨在探索实体之间的关系,这些实体通常在异构数据集中缺少通用标识符。当实体包含多个关系记录时,通过将记录视为组,可以更准确地跨数据集链接它们,这导致了组链接方法。即使这样,最后的组链接步骤仍可能需要单独的记录链接。该问题可以通过多实例学习来解决,其中将组链接建模为包,将记录链接视为实例。在本文中,我们提出了一种新的实例分类和组记录链接方法,该方法通过从实例中重构袋子进行。袋重构基于通过核密度估计对训练袋中否定实例的分布进行建模。我们在合成数据和真实数据上都评估了这种方法。我们的结果表明,所提出的方法可以优于几种基准方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号