首页> 外文期刊>International journal of systems assurance engineering and management >Empirical analysis of metrics for object oriented multidimensional model of data warehouse using unsupervised machine learning techniques
【24h】

Empirical analysis of metrics for object oriented multidimensional model of data warehouse using unsupervised machine learning techniques

机译:使用无监督机器学习技术的数据仓库面向对象多维模型指标的实证分析

获取原文
获取原文并翻译 | 示例
           

摘要

Data Warehouse provides the foundation for businesses to take informed decisions for day to day operations and making future strategy. Since the role is so pivotal to the growth and success of the business, its quality is very critical. Conceptual models of data warehouses give us a great insight into the quality of the developed system during the early stages of the design process. Researchers have proposed a number of metrics to evaluate the quality of these object oriented multidimensional models. Further, for these metrics to be used in practice, empirical evaluation is crucial. There are a number of propositions in literature that work towards empirical validation of metrics. But most of them are either restricted to statistical techniques or supervised machine learning techniques. In order to empirically validate the metrics, we need to get user responses for a number of schemas and take down observations to quantify model quality aspects like understand-ability, efficiency etc. This can result in personal biases, errors and random outliers which impacts the evaluation model. In this paper, we have made a first attempt to assess the relationship between the object oriented multidimensional data warehouse structural metrics and understandability of its models by using unsupervised machine learning techniques with the aid of a data warehouse quality expert. The results indicate that the proposed metrics have a strong relationship with understandability and intum quality of the data warehouse conceptual models and the unsupervised techniques are able to identify this relationship with high degree of accuracy.
机译:数据仓库为企业为日常运营和制定未来战略做出明智的决策提供了基础。由于角色对于企业的成长和成功至关重要,因此其质量至关重要。数据仓库的概念模型使我们可以在设计过程的早期阶段深入了解已开发系统的质量。研究人员提出了许多衡量标准,以评估这些面向对象的多维模型的质量。此外,对于要在实践中使用这些指标,经验评估至关重要。文献中有许多主张可以对度量进行经验验证。但是它们中的大多数要么局限于统计技术,要么限于受监督的机器学习技术。为了凭经验验证指标,我们需要获得用户对多种模式的响应,并进行观察以量化模型质量方面的内容,例如可理解性,效率等。这可能会导致个人偏见,错误和随机离群值,从而影响评价模型。在本文中,我们首次尝试通过在数据仓库质量专家的帮助下使用无监督的机器学习技术来评估面向对象的多维数据仓库结构度量与其模型的可理解性之间的关系。结果表明,所提出的度量标准与数据仓库概念模型的可理解性和intum质量有很强的关系,而无监督技术则可以高度准确地识别这种关系。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号