Data Warehouse Integration Using Best Fit Matching

机译：数据仓库集成使用最佳配合匹配

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Much of the research related to information retrieval focuses on finding as many relevant pieces of data about a topic as possible. With respect to identity matching, this approach would find as many possible variants of a search name as possible. This works fine for some applications, but there are also times when the requirement is for a best fit match. In this case, we expect that a single name has zero or one matches in another data set. Many systems accomplish this by use of a hard key, such as driver's license number or social security number. This paper presents SQL based soft matching to get a best fit match. This is valuable when one or more of the data sets being compared has data quality problems or lacks complete and reliable hard keys.

机译：与信息检索有关的大部分研究侧重于找到尽可能多的关于主题的相关数据。关于身份匹配，这种方法会发现尽可能多的搜索名称的可能变体。这适用于某些应用程序，但需要时需要符合最佳匹配。在这种情况下，我们希望单个名称在另一个数据集中具有零或匹配项。许多系统通过使用硬密钥来完成此操作，例如驾驶执照号码或社会安全号码。本文介绍了基于SQL的软匹配，以获得最佳匹配。当被比较的一个或多个数据集具有数据质量问题或缺少完整且可靠的硬键时，这是有价值的。

著录项

来源
《International conference on information and knowledge engineering》|2003年||共5页
会议地点
作者
David Holmes; Donald Maxwell;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息与传播理论;
关键词

相似文献

外文文献
中文文献
专利

1. Knowledge warehouse: an architectural integration of knowledge management, decision support, artificial intelligence and data warehousing [J] . Knowledge warehouse, David M. Steiger, Lashmi S. Iyer Decision support systems . 2002,第2期

机译：知识仓库：知识管理，决策支持，人工智能和数据仓库的架构集成
2. Privacy and Confidentiality in an e-Commerce World: Data Mining, Data Warehousing, Matching and Disclosure Limitation [J] . Stephen E. Fienberg Statistical science . 2006,第2期

机译：电子商务世界中的隐私和机密性：数据挖掘，数据仓库，匹配和披露限制
3. Record matching in data warehouses: A decision model for data consolidation [J] . Dey D. Operations Research: The Journal of the Operations Research Society of America . 2003,第2期

机译：数据仓库中的记录匹配：数据整合的决策模型
4. Data Warehouse Integration Using Best Fit Matching [C] . David Holmes, Donald Maxwell Proceedings of the International Conference on Information and Knowledge Engineering(IKE'03) . 2003

机译：使用最佳匹配进行数据仓库集成
5. Data integration with data warehousing and data mining in database environments [D] . White, Gary Leon. 1996

机译：在数据库环境中与数据仓库和数据挖掘进行数据集成
6. Benefits of a clinical data warehouse with data mining tools to collect data for a radiotherapy trial [O] . Erik Roelofs, Lucas Persoon, Sebastiaan Nijsten, -1

机译：具有数据挖掘工具的临床数据仓库的优势以收集放射治疗试验的数据
7. Strategy and methodology for enterprise data warehouse development. Integrating data mining and social networking techniques for identifying different communities within the data warehouse. [O] . Rifaie Mohammad 2010

机译：企业数据仓库开发的策略和方法。集成了数据挖掘和社交网络技术，以识别数据仓库内的不同社区。

Data Warehouse Integration Using Best Fit Matching

摘要

著录项

相似文献

相关主题

期刊订阅