【24h】

Holistic Schema Matching for Web Query Interfaces

机译:Web查询界面的整体架构匹配

获取原文
获取原文并翻译 | 示例

摘要

One significant part of today's Web is Web databases, which can dynamically provide information in response to user queries. To help users submit queries to different Web databases, the query interface matching problem needs to be addressed. To solve this problem, we propose a new complex schema matching approach, Holistic Schema Matching (HSM). By examining the query interfaces of real Web databases, we observe that attribute matchings can be discovered from attribute-occurrence patterns. For example, First Name often appears together with Last Name while it is rarely co-present with Author in the Books domain. Thus, we design a count-based greedy algorithm to identify which attributes are more likely to be matched in the query interfaces. In particular, HSM can identify both simple matching i.e., 1:1 matching, and complex matching, i.e., l:n or m:n matching, between attributes. Our experiments show that HSM can discover both simple and complex matchings accurately and efficiently on real data sets.
机译:当今Web的重要组成部分是Web数据库,它可以根据用户查询动态提供信息。为了帮助用户将查询提交到不同的Web数据库,需要解决查询界面匹配问题。为了解决此问题,我们提出了一种新的复杂模式匹配方法,即整体模式匹配(HSM)。通过检查真实Web数据库的查询接口,我们观察到可以从属性出现模式中发现属性匹配。例如,“名字”通常与“姓氏”一起出现,而在“图书”域中很少与“作者”共同出现。因此,我们设计了一种基于计数的贪婪算法,以识别查询界面中更可能匹配哪些属性。特别是,HSM可以识别属性之间的简单匹配(即1:1匹配)和复杂匹配(即l:n或m:n匹配)。我们的实验表明,HSM可以在实际数据集上准确有效地发现简单和复杂的匹配。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号