首页> 外文期刊>Expert Systems with Application >Record linkage based on a three-way decision with the use of granular descriptors
【24h】

Record linkage based on a three-way decision with the use of granular descriptors

机译:使用粒度描述符基于三路决策记录链接

获取原文
获取原文并翻译 | 示例

摘要

Record linkage is a typical two-class recognition problem in data mining. To improve its classification performance of the problem, this paper proposes to apply three-way classification to identify uncertain points (regions) for further clerical investigation in decision-making. The detailed three-way decision process is realized by a two-phase approach. During the first phase, an information granule is constructed to describe the uncertain region in the data space. In the second phase, the constructed granule is utilized to discriminate between certain points (those with a high likelihood of belonging to one of the classes) and uncertain points (viz. those requiring clerical attention). For uncertain points, manual investigation is realized; for certain points, the generic binary classifier is applied for classification. Synthetic data and publicly available data are used to demonstrate the performance of the proposed approach. Finally, the proposed approach is shown effective in applications involving real-world record linkage data. (C) 2018 Elsevier Ltd. All rights reserved.
机译:记录链接是数据挖掘中典型的两类识别问题。为了提高对问题的分类性能,本文建议应用三向分类法来识别不确定点(区域),以便在决策中进行进一步的文书调查。详细的三路决策过程通过两阶段方法实现。在第一阶段,将构造一个信息粒度来描述数据空间中的不确定区域。在第二阶段中,使用构造的颗粒区分某些点(那些极有可能属于其中一类)和不确定点(即需要文员注意的点)。对于不确定的点,可以进行手动调查。对于某些点,将通用二进制分类器应用于分类。综合数据和公开可用数据用于证明所提出方法的性能。最后,所提出的方法在涉及实际记录链接数据的应用中显示出了有效的效果。 (C)2018 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号