首页> 外文期刊>Data & Knowledge Engineering >Representation of conceptual ETL designs in natural language using Semantic Web technology
【24h】

Representation of conceptual ETL designs in natural language using Semantic Web technology

机译:使用语义Web技术以自然语言表示概念性ETL设计

获取原文
获取原文并翻译 | 示例
       

摘要

Extract-Transform-Load (ETL) processes constitute the back stage of Data Warehouse architectures. Several studies characterize the ETL design as a time-consuming and error-prone procedure. A critical phase in the ETL lifecycle involves the early communications and design steps that aim at producing a conceptual ETL design. Various research approaches have dealt with the conceptual modeling of ETL processes, but all share two inconveniences: they require intensive human effort from the designers to create them, as well as technical knowledge from the business people to understand them. In this paper, we focus on the second aspect and provide a method for the representation of a conceptual ETL design as a narrative, which is the most natural means of communication and does not require particular technical skills or familiarity with any specific model. Specifically, this work builds upon previously proposed techniques that automate the conceptual design by leveraging Semantic Web technology. The key idea is to map the involved data stores, either source or target, to a domain ontology and then, to use a reasoner for producing the ETL design. We discuss how linguistic techniques can be used for the establishment of a common application vocabulary. We present a flexible and customizable template-based mechanism for the representation of the ETL design as a narrative. Finally, we discuss issues related to the production of meaningful reports and we provide implementation details.
机译:提取转换加载(ETL)流程构成了数据仓库体系结构的后台。多项研究将ETL设计描述为耗时且容易出错的过程。 ETL生命周期的关键阶段涉及早期的沟通和设计步骤,旨在产生概念化的ETL设计。各种研究方法已经处理了ETL流程的概念建模,但是所有方法都有两个不便之处:它们需要设计人员的大量人力来创建它们,以及需要商人的技术知识才能理解它们。在本文中,我们专注于第二个方面,并提供了一种将概念性ETL设计表示为叙事的方法,这是最自然的交流手段,不需要特定的技术技能或对任何特定模型的熟悉。具体来说,这项工作建立在先前提出的技术上,这些技术通过利用语义Web技术来自动化概念设计。关键思想是将涉及的数据存储(源或目标)映射到域本体,然后使用推理程序生成ETL设计。我们讨论语言技术如何用于建立通用的应用程序词汇表。我们提出了一种灵活且可自定义的基于模板的机制,用于将ETL设计表示为叙述。最后,我们讨论与生成有意义的报告有关的问题,并提供实施细节。

著录项

  • 来源
    《Data & Knowledge Engineering》 |2010年第1期|96-115|共20页
  • 作者单位

    Hewlett-Packard Laboratories, Intelligent Information Management Lab (IIML), 1501 Page Mill Road, B1U/C15, M/S 1142, Palo Alto, CA 94304-1126, USA;

    National Technical University of Athens Athens, Greece;

    Hewlett-Packard Laboratories, Intelligent Information Management Lab (IIML), 1501 Page Mill Road, B1U/C15, M/S 1142, Palo Alto, CA 94304-1126, USA;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    ETL; data warehouses; conceptual model; natural language; ontologies; semantic web; metadata;

    机译:ETL;数据仓库;概念模型;自然语言本体;语义网元数据;
  • 入库时间 2022-08-18 02:18:03

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号