首页> 外文学位 >Design and implementation of an institutional data warehouse from a decision support perspective.
【24h】

Design and implementation of an institutional data warehouse from a decision support perspective.

机译:从决策支持的角度设计和实施机构数据仓库。

获取原文
获取原文并翻译 | 示例

摘要

The following project has consisted in the design and implementation of a methodology for the realization of a data warehouse targeted for a university business environment. The principle objective of such an endeavor has been to empower decision makers via the use of dashboard technology. More specifically this project, based on both a theoretical and a particularly strong practical background, has been specifically within the context of better managing university student clientele.;The early phase of our project consisted of evaluating various data warehousing design methodologies. In fact, the methodology we have developed is based on Ralph Kimbal's methodology which is based on the principle of realizing dimensional models which are end-to-end business process and subject-oriented. Hence, the design methodology we shall propose herein aims at encouraging scalability as the project continues to evolve and new heterogeneous data sources become available to end-users. Subsequently, a detailed analysis of existing commercial and open-source data warehouse and business intelligence products was carried out with the aim of eventually recommending a suitable solution at UQTR for meeting the business requirements. More specifically, two types of tools were deemed necessary for the project: a back-end extraction, transformation and loading tool (ETC) and a front-end business intelligence framework for presenting data and information to end-users via the Web. Finally, we have proposed a framework for managing table definition and structure changes which ultimately aims at empowering the user with richer meta-data. This can be particularly helpful for end-users by providing helpful hints on missing values which inherently result from data structure changes.;Once the systems were integrated within our data warehouse environment, the data was made available to users either in the form of reports using various data formats (i.e. web report, MS-Excel, etc.) or OLAP analysis. Eventually, once the data warehouse shall have sufficiently evolved, future plans are underway to make data mining technology available to end-users.;One of the key challenges that has arisen during our research is how one should manage the evolution and changes to the table structures within our operational systems (OLTP), in order to minimize the impact on the data warehouse. By "evolution" we imply any addition, modification or removal of a table field. We have had to consider how to present to the end-user all these possible table structure changes and how they relate to the presence of missing values within the data warehouse. The objective has been for such biases or anomalies to be properly managed and have a transparent impact on the business end-user. Another difficulty to manage has been how to inform the end-user of events which could have occurred that can have a direct impact on the business value of the data available for a given table field. For instance, the occurrence of a strike by professors at a university could superficially increase the student failure rate. Without such key information, a business analyst would have no means for properly correcting such a situation and correctly interpreting the associated data with such an event (i.e. biased average and standard deviation measures).;During the most demanding phase of the data preparation stage, a major integration problem was observed. The legacy systems did not make use of foreign key constraints and this had a direct impact on the integrity of the data that was migrated to the data warehouse (i.e. relation model to dimensional model transformation). As such, two tools have been implemented in order to manage the integrity and extensibility of table definitions: a data analysis tool and a data definition language (DDL) analysis tool.;Though the implementation of this project shall be simplified by the use of such tools, new challenges shall arise. For instance, it shall become important to consider the "intelligent" management of large masses of data by making use of archiving and re-integration techniques in order to potentially satisfy many business analysis requirements from end-users.
机译:以下项目包括设计和实现用于实现针对大学业务环境的数据仓库的方法。这项工作的主要目标是通过使用仪表板技术来增强决策者的能力。更具体地说,该项目基于理论和特别强的实践背景,特别是在更好地管理大学生客户群的背景下进行的。我们项目的早期阶段包括评估各种数据仓库设计方法。实际上,我们开发的方法是基于Ralph Kimbal的方法,该方法基于实现端到端业务流程和面向主题的维度模型的原理。因此,我们将在本文中提出的设计方法旨在鼓励可扩展性,因为该项目将继续发展,并且新的异构数据源可用于最终用户。随后,对现有的商业和开源数据仓库以及商业智能产品进行了详细的分析,目的是最终在UQTR上推荐一个合适的解决方案以满足商业需求。更具体地说,该项目认为需要两种工具:后端提取,转换和加载工具(ETC)和用于通过Web向最终用户呈现数据和信息的前端商业智能框架。最后,我们提出了一个用于管理表定义和结构更改的框架,该框架最终旨在为用户提供更丰富的元数据。通过提供有关数据结构变化固有的缺失值的有用提示,这对最终用户特别有用。一旦将这些系统集成到我们的数据仓库环境中,就可以使用以下两种形式以报告的形式向用户提供数据:各种数据格式(例如Web报告,MS-Excel等)或OLAP分析。最终,一旦数据仓库得到充分的发展,未来的计划就会在进行中,以使数据挖掘技术可供最终用户使用。;在我们的研究过程中出现的主要挑战之一是如何应对表的发展和变化我们的操作系统(OLTP)中的结构,以最大程度地减少对数据仓库的影响。通过“演变”,我们暗示对表字段进行任何添加,修改或删除。我们不得不考虑如何向最终用户呈现所有这些可能的表结构更改,以及它们如何与数据仓库中缺失值的存在相关联。目的是要适当地管理此类偏差或异常,并对业务最终用户产生透明影响。管理的另一个困难是如何通知最终用户可能发生的事件,这些事件可能直接影响给定表字段的可用数据的业务价值。例如,大学教授罢工的发生可能从表面上增加学生的失败率。没有此类关键信息,业务分析师将无法正确纠正这种情况并通过此类事件正确解释相关数据(即偏差平均和标准偏差度量值)。在数据准备阶段最苛刻的阶段,观察到一个主要的集成问题。遗留系统没有使用外键约束,这直接影响了迁移到数据仓库的数据的完整性(即关系模型到维模型转换)。这样,为了管理表定义的完整性和可扩展性,已经实现了两个工具:一个数据分析工具和一个数据定义语言(DDL)分析工具。尽管应使用该工具简化该项目的实施工具,将会出现新的挑战。例如,重要的是要考虑使用归档和重新集成技术来对海量数据进行“智能”管理,以便潜在地满足最终用户的许多业务分析要求。

著录项

  • 作者

    Denis, Marie-Chantal.;

  • 作者单位

    Universite du Quebec a Trois-Rivieres (Canada).;

  • 授予单位 Universite du Quebec a Trois-Rivieres (Canada).;
  • 学科 Mathematics.;Information Science.
  • 学位 M.Sc.
  • 年度 2008
  • 页码 170 p.
  • 总页数 170
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 数学;信息与知识传播;
  • 关键词

  • 入库时间 2022-08-17 11:38:58

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号