首页> 外文学位 >Interactive and modular design of schema mappings.
【24h】

Interactive and modular design of schema mappings.

机译:模式映射的交互式和模块化设计。

获取原文
获取原文并翻译 | 示例

摘要

A primordial task in information integration is to specify the relationships, called schema mappings, between database schemas. One of the fundamental applications of schema mappings is to specify how data structured under a source schema is to be transformed into data structured under a target schema. Since schemas that occur in real life are typically large and heterogeneous, designing schema mappings is an error-prone, laborious, and time consuming process.;This dissertation studies a novel "divide-and-merge" paradigm for schema mapping creation. Our framework allows a design task to be divided into smaller components that are easier to create and understand. Each of the component schema mappings can be designed independently, through an interactive process driven by data examples. To complete the design process, the novel MapMerge schema mapping operator can be used to automatically generate a meaningful overall mapping by correlating the specifications given by the individual mapping components. Specifically, my thesis explores how to facilitate the process of designing each schema mapping through data examples and how to assemble the independent schema mappings into a global mapping.;To design a schema mapping, the user can provide a set of data examples, each representing a partial specification of the semantics of the desired schema mapping. Based on such a set of data examples, the proposed techniques construct a schema mapping specified by Global-and-Local-As-View constraints that "fits" the data examples, if such mapping exists. Furthermore, system generated data examples can be used to guide the user through a schema mapping refinement process, focusing on specific components of a mapping specification, such as the design of grouping semantics, or the choice of the desired interpretation in the case of ambiguous mappings.;The flows of independently designed schema mappings can then be automatically orchestrated into larger, semantically richer schema mappings through the novel MapMerge schema mapping operator. The key idea behind MapMerge is the reuse of mapping behavior from more general mappings to more specific mappings. MapMerge allows for the modular construction of complex mappings from various types of smaller mappings, such as schema correspondences produced by a schema matcher or pre-existing mappings that were designed by a human user (for instance, through the proposed techniques based on data examples), or via more traditional mapping tools. It was shown experimentally that MapMerge improves the quality of the schema mappings in terms of preserving data associations from the input source instance to the generated target instance.;Finally, a novel benchmark was used to assess the relative merits of existing mapping-design systems. The findings of this benchmark on a set of commercial and research systems confirmed the high costs of traditional schema mapping design in terms of time and effort, and thus provided further motivation for the alternative schema mapping design methodology proposed in this dissertation.
机译:信息集成的首要任务是指定数据库架构之间的关系,称为架构映射。模式映射的基本应用之一是指定如何将在源模式下构造的数据转换为在目标模式下构造的数据。由于现实生活中出现的模式通常庞大且异构,因此设计模式映射是一个容易出错,费力且耗时的过程。本论文研究了一种用于模式映射创建的新颖的“划分并合并”范式。我们的框架允许将设计任务划分为更易于创建和理解的较小组件。每个组件架构映射可以通过数据示例驱动的交互式过程独立设计。为了完成设计过程,可以使用新颖的MapMerge模式映射运算符通过关联各个映射组件给出的规范来自动生成有意义的总体映射。具体来说,本文探讨了如何通过数据示例来促进设计每个模式映射的过程以及如何将独立的模式映射组装成全局映射。为了设计模式映射,用户可以提供一组数据示例,每个数据示例代表所需模式映射的语义的部分规范。基于这样的一组数据示例,如果存在这样的映射,则所提出的技术将构造由“适合”数据示例的“全局和局部视域”约束所指定的模式映射。此外,系统生成的数据示例可用于指导用户完成模式映射优化过程,重点关注映射规范的特定组件,例如分组语义的设计,或在模棱两可的映射情况下选择所需的解释然后,可以通过新颖的MapMerge模式映射运算符将独立设计的模式映射的流程自动编排为更大,语义更丰富的模式映射。 MapMerge背后的关键思想是将映射行为从更一般的映射重用到更具体的映射。 MapMerge允许从各种类型的较小映射(例如,由模式匹配器生成的模式对应关系或由人类用户设计的预先存在的映射,例如通过基于数据示例的建议技术)构建复杂映射的模块化构建,或通过更传统的地图绘制工具。实验表明,MapMerge在保留从输入源实例到生成的目标实例的数据关联方面提高了模式映射的质量。最后,使用一种新颖的基准来评估现有映射设计系统的相对优点。该基准在一组商业和研究系统上的发现证实了传统模式映射设计在时间和精力上的高昂成本,从而为本文提出的替代模式映射设计方法论提供了进一步的动力。

著录项

  • 作者

    Alexe, Bogdan.;

  • 作者单位

    University of California, Santa Cruz.;

  • 授予单位 University of California, Santa Cruz.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2011
  • 页码 217 p.
  • 总页数 217
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号