...
首页> 外文期刊>SIGMOD record >A Holistic Paradigm for Large Scale Schema Matching
【24h】

A Holistic Paradigm for Large Scale Schema Matching

机译:大规模模式匹配的整体范式

获取原文
获取原文并翻译 | 示例
           

摘要

Schema matching is a critical problem for integrating heterogeneous information sources. Traditionally, the problem of matching multiple schemas has essentially relied on finding pairwise-attribute correspondences in isolation. In contrast, we propose a new matching paradigm, holistic schema matching, to match many schemas at the same time and find all matchings at once. By handling a set of schemas together, we can explore their context information that reflects the semantic correspondences among attributes. Such information is not available when schemas are matched only in pairs. As the realizations of holistic schema matching, we develop two alternative approaches: global evaluation and local evaluation. Global evaluation exhaustively assesses all possible "models," where a model expresses all attribute matchings. In particular, we propose the MGS framework for such global evaluation, building upon the hypothesis of the existence of a hidden schema model that probabilistically generates the schemas we observed. On the other hand, local evaluation independently assesses every single matching to incrementally construct such a model. In particular, we develop the DCM framework for local evaluation, building upon the observation that co-occurrence patterns across schemas often reveal the complex relationships of attributes. We apply our approaches to match query interfaces on the deep Web. The result shows the effectiveness of both the MGS and DCM approaches, which together demonstrate the promise of holistic schema matching.
机译:模式匹配是集成异构信息源的关键问题。传统上,匹配多个模式的问题基本上依赖于孤立地查找成对属性对应。相反,我们提出了一个新的匹配范式,即整体模式匹配,以同时匹配多个模式并立即查找所有匹配。通过一起处理一组模式,我们可以探索它们的上下文信息,以反映属性之间的语义对应。当模式仅成对匹配时,此类信息不可用。随着整体方案匹配的实现,我们开发了两种替代方法:全局评估和局部评估。全局评估详尽评估所有可能的“模型”,其中模型表示所有属性匹配。尤其是,我们基于存在隐式模式模型的假设而提出了用于此类全局评估的MGS框架,该模型以概率方式生成了我们观察到的模式。另一方面,本地评估会独立评估每个匹配项,以逐步构建此类模型。特别是,我们在观察到跨架构的共现模式通常揭示属性的复杂关系的基础上,开发了用于局部评估的DCM框架。我们应用我们的方法来匹配深度Web上的查询接口。结果显示了MGS和DCM方法的有效性,共同证明了整体方案匹配的前景。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号