首页> 外文会议>IEEE Conference on Business Informatics >ELM: An Extended Logic Matching Method on Record Linkage Analysis of Disparate Databases for Profiling Data Mining
【24h】

ELM: An Extended Logic Matching Method on Record Linkage Analysis of Disparate Databases for Profiling Data Mining

机译:ELM:用于分析数据挖掘的不同数据库的记录链接分析的扩展逻辑匹配方法

获取原文

摘要

As predictive marketing and customer profiling solutions have become more sophisticated, they have increasingly become dependent on data from external sources. In order to utilize this data, records must be linked to internal records without the use of unique identifiers. The Extendable Logic for Matching (ELM) performs probabilistic matching from disparate sources and classifies matches according to discrete values reflective of their utility. Sets of matching rules are evaluated based on their performance on supervised classification tasks. High performance on a classification task is indicative of congruity with the real-world entity concerned, giving a sense of matching quality without the use of a gold standard. A set of matching rules generated using name and address was compared to a set which was matched using exact string comparison. We conclude that exact string comparison is a superior method for matching on highly sparse demographic data from disparate sources.
机译:随着预测性营销和客户分析解决方案变得越来越复杂,它们越来越依赖于来自外部来源的数据。为了利用此数据,必须将记录链接到内部记录,而无需使用唯一标识符。匹配的可扩展逻辑(ELM)从不同的源执行概率匹配,并根据反映其效用的离散值对匹配进行分类。根据匹配规则集在监督分类任务中的性能来评估它们。分类任务的高性能表明它与相关的真实世界实体保持一致,从而在不使用黄金标准的情况下提供了匹配质量的感觉。将使用名称和地址生成的一组匹配规则与使用精确字符串比较进行匹配的一组进行比较。我们得出结论,精确的字符串比较是匹配来自不同来源的高度稀疏的人口统计数据的一种出色方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号