A Unified Approach for Schema Matching, Coreference and Canonicalization

机译：模式匹配，共指和规范化的统一方法

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The automatic consolidation of database records from many heterogeneous sources into a single repository requires solving several information integration tasks. Although tasks such as coreference, schema matching, and canonicalization are closely related, they are most commonly studied in isolation. Systems that do tackle multiple integration problems traditionally solve each independently, allowing errors to propagate from one task to another. In this paper, we describe a discriminatively-trained model that reasons about schema matching, coreference, and canonicalization jointly. We evaluate our model on a real-world data set of people and demonstrate that simultaneously solving these tasks reduces errors over a cascaded or isolated approach. Our experiments show that a joint model is able to improve substantially over systems that either solve each task in isolation or with the conventional cascade. We demonstrate nearly a 50% error reduction for coreference and a 40% error reduction for schema matching.

机译：将来自许多异构源的数据库记录自动合并到单个存储库中需要解决多个信息集成任务。尽管诸如共同引用，模式匹配和规范化之类的任务紧密相关，但最常单独研究它们。传统上，解决多个集成问题的系统会独立解决每个问题，从而使错误从一项任务传播到另一项任务。在本文中，我们描述了一个经过区别训练的模型，该模型共同说明了模式匹配，共引用和规范化的原因。我们在真实的人员数据集上评估了我们的模型，并证明了同时解决这些任务可以减少级联或孤立方法的错误。我们的实验表明，联合模型能够大大改善系统的性能，该系统可以单独解决问题，也可以使用常规级联解决每个任务。我们证明了共引用的错误减少了近50％，模式匹配的错误减少了40％。

著录项

来源
《ACMKDD International Conference on Knowledge Discovery and Data Mining;KDD 2008》|2008年|704-712|共9页
会议地点
作者
Michael Wick; Khashayar Rohanimanesh; Karl Schultz; Andrew McCallum;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息与知识传播;
关键词
data integration; coreference; schema matching; canoni-calization; conditional random field; weighted logic;

机译：数据整合;共指模式匹配;焦糖化条件随机场加权逻辑;

相似文献

外文文献
中文文献
专利

1. A Semantic Resource Based Approach for Star Schemas Matching [J] . Elhaj Elamin, Amer Alzaidi, Jamel Feki International Journal of Database Management Systems . 2018,第6期

机译：基于语义资源的星图匹配方法
2. Linear canonical transformations and quantum phase: a unified canonical and algebraic approach [J] . T Hakioglu Journal of Physics, A. Mathematical and General: A Europhysics Journal . 1999,第22期

机译：线性规范变换和量子相：统一的规范和代数方法
3. A schema matching approach for integrated mobility service [J] . Hirashima Yoko, Komoda Norihisa, Fujiwara Toru Electronics and communications in Japan . 2019,第9期

机译：集成移动服务的模式匹配方法
4. A unified approach for schema matching, coreference and canonicalization [C] . Michael L. Wick, Khashayar Rohanimanesh, Karl Schultz, ACM SIGKDD international conference on Knowledge discovery and data mining . 2008

机译：模式匹配，共指和规范化的统一方法
5. A unified approach to primary key generation in banking risk reporting system using a new database schema [D] . Khandelwal, Ashish 2008

机译：使用新的数据库架构在银行风险报告系统中生成主密钥的统一方法
6. Adverse Childhood Experiences and Early Maladaptive Schemas as Predictors of Cyber Dating Abuse: An Actor-Partner Interdependence Mediation Model Approach [O] . Laura Celsi, F. Giorgia Paleari, Frank D. Fincham 2021

机译：不利的童年经验和早期的不适性模式作为网络约会滥用的预测因素：演员合作伙伴相互依存调解模型方法
7. A Unified Approach for Schema Matching, Coreference and Canonicalization [O] . Wick, Michael, Rohanimanesh, Khashayar, Schultz, Karl, 2008

机译：图式匹配，共指和规范化的统一方法
8. Unified Approach to Dynamic Matching and Barter Exchange. [R] . Dickerson, J. P. 2016

机译：动态匹配和易货交换的统一方法。

A Unified Approach for Schema Matching, Coreference and Canonicalization

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅