首页> 外文期刊>The Computer journal >Automating Data Mart Construction from Semi-structured Data Sources
【24h】

Automating Data Mart Construction from Semi-structured Data Sources

机译:从半结构化数据源自动化数据集市构建

获取原文
获取原文并翻译 | 示例
           

摘要

The global food and agricultural industry has a total market value of USD 8 trillion in 2016, and decision makers in the Agri sector require appropriate tools and up-to-date information to make predictions across a range of products and areas. Traditionally, these requirements are met with information processed into a data warehouse and data marts constructed for analyses. Increasingly however, data are coming from outside the enterprise and often in unprocessed forms. As these sources are outside the control of companies, they are prone to change and new sources may appear. In these cases, the process of accommodating these sources can be costly and very time consuming. To automate this process, what is required is a sufficiently robust extract-transform-load process; external sources are mapped to some form of ontology, and an integration process to merge the specific data sources. In this paper, we present an approach to automating the integration of data sources in an Agri environment, where new sources are examined before an attempt to merge them with existing data marts. Our validation uses a case study of real world Agri data to demonstrate the robustness of our approach and the efficiency of materializing data marts.
机译:2016年全球食品和农业产业的总市值达到8万亿美元,农业领域的决策者需要适当的工具和最新信息,才能对各种产品和领域进行预测。传统上,将这些信息处理到数据仓库和用于分析的数据集市中即可满足这些要求。但是,越来越多的数据来自企业外部,并且通常是未经处理的形式。由于这些来源不受公司控制,因此它们很容易发生变化,因此可能会出现新的来源。在这些情况下,容纳这些源的过程可能是昂贵且非常耗时的。为了使该过程自动化,需要足够强大的提取-转换-加载过程。外部源被映射到某种形式的本体,以及用于合并特定数据源的集成过程。在本文中,我们提出了一种在Agri环境中自动集成数据源的方法,其中在尝试将新数据源与现有数据集市合并之前会对其进行检查。我们的验证通过对真实农业数据的案例研究来证明我们的方法的鲁棒性和实现数据集市的效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号