首页> 外文会议>International conference on management of data >Designing and Refining Schema Mappings via Data Examples
【24h】

Designing and Refining Schema Mappings via Data Examples

机译:通过数据示例设计和完善模式映射

获取原文

摘要

A schema mapping is a specification of the relationship between a source schema and a target schema. Schema mappings are fundamental building blocks in data integration and data exchange and, as such, obtaining the right schema mapping constitutes a major step towards the integration or exchange of data. Up to now, schema mappings have typically been specified manually or have been derived using mapping-design systems that automatically generate a schema mapping from a visual specification of the relationship between two schemas.We present a novel paradigm and develop a system for the interactive design of schema mappings via data examples. Each data example represents a partial specification of the semantics of the desired schema mapping. At the core of our system lies a sound and complete algorithm that, given a finite set of data examples, decides whether or not there exists a GLAV schema mapping (i.e., a schema mapping specified by Global-and-Local-As-View constraints) that "fits" these data examples. If such a fitting GLAV schema mapping exists, then our system constructs the "most general" one.We give a rigorous computational complexity analysis of the underlying decision problem concerning the existence of a fitting GLAV schema mapping, given a set of data examples. Specifically, we prove that this problem is complete for the second level of the polynomial hierarchy, hence, in a precise sense, harder than NP-complete. This worst-case complexity analysis notwithstanding, we conduct an experimental evaluation of our prototype implementation that demonstrates the feasibility of interactively designing schema mappings using data examples. In particular, our experiments show that our system achieves very good performance in real-life scenarios.
机译:模式映射是源模式和目标模式之间关系的规范。模式映射是数据集成和数据交换中的基本构建块,因此,获取正确的模式映射是迈向数据集成或交换的重要一步。到目前为止,模式映射通常是手动指定的,或者是使用映射设计系统派生的,该系统会根据两个模式之间的关系的可视化说明自动生成模式映射。我们提出了一种新颖的范例,并开发了用于交互式设计的系统通过数据示例进行模式映射。每个数据示例表示所需架构映射的语义的部分说明。系统的核心是一个完善的算法,给定一组有限的数据示例,该算法将确定是否存在GLAV模式映射(即,由“全局和局部视域约束”指定的模式映射) ),以“适合”这些数据示例。如果存在这样的拟合GLAV模式映射,则我们的系统将构建“最通用”的映射。我们给出了与存在GLAV模式映射有关的基本决策问题的严格计算复杂度分析,并给出了一组数据示例。具体来说,我们证明该问题对于多项式层次结构的第二级是完整的,因此,从精确的意义上讲,它比NP-complete困难。尽管如此,尽管进行了最坏情况的复杂性分析,我们仍对原型实现进行了实验评估,从而证明了使用数据示例交互式设计方案映射的可行性。特别是,我们的实验表明,我们的系统在现实生活中可以实现非常好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号