首页> 外文学位 >Source discovery and schema mapping for data integration.
【24h】

Source discovery and schema mapping for data integration.

机译:用于数据集成的源发现和模式映射。

获取原文
获取原文并翻译 | 示例

摘要

As data explodes on the Web, there is a need to integrate data from a large number of heterogeneous information sources. Currently, there are two main basic approaches to data integration: Global-as-View (GAV) and Local-as-View (LAV). However, both approaches have their limitations for large-scale applications. To resolve the problems, we offer a Target-based Integration Query System (TIQS) as an alternative point of view that is neither GAV nor LAV The approach uses a predefined conceptual target schema, which is specified ontologically and independently of any of the sources, as a central, organizing concept. In this dissertation, we focus on the resolutions to three problems in TIQS: (1) automatically recognizing information sources for the target, (2) automating source-to-target mappings between source and target schemas, and (3) query reformulation based on source-to-target mappings. Experiments we have conducted show that we have been able to achieve good performance for the recognition of applicable documents as well as the generation of source-to-target mappings. Moreover, we have proven that query reformulation in TIQS reduces to rule unfolding and the reformulated user queries extract all the query answers available from sources with respect to the definition of TIQS for the proposed queries.
机译:随着数据在Web上的爆炸性增长,需要集成来自大量异构信息源的数据。当前,数据集成有两种主要的基本方法:全局视域(GAV)和局部视域(LAV)。但是,这两种方法在大规模应用中都有其局限性。为了解决这些问题,我们提供了基于目标的集成查询系统(TIQS)作为GAV或LAV的替代观点。该方法使用预定义的概念目标架构,该架构是在本体上指定的,与任何来源无关,作为一个中心的,组织的概念。在本文中,我们着重解决TIQS中的三个问题:(1)自动识别目标的信息源;(2)自动实现源和目标模式之间的源到目标映射;以及(3)基于源到目标的映射。我们进行的实验表明,我们已经能够在识别适用文档以及生成源到目标映射方面取得良好的性能。此外,我们已经证明,TIQS中的查询重新构造减少了规则展开,并且重新构造的用户查询针对建议的查询的TIQS定义,从源中提取了所有可用的查询答案。

著录项

  • 作者

    Xu, Li.;

  • 作者单位

    Brigham Young University.;

  • 授予单位 Brigham Young University.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2003
  • 页码 137 p.
  • 总页数 137
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号