首页> 外文期刊>Expert Systems with Application >A framework for data transformation in Credit Behavioral Scoring applications based on Model Driven Development
【24h】

A framework for data transformation in Credit Behavioral Scoring applications based on Model Driven Development

机译:基于模型驱动开发的信用行为评分应用程序中的数据转换框架

获取原文
获取原文并翻译 | 示例
       

摘要

The preprocessing stage in knowledge discovery projects is costly, normally taking between 50% and 80% of the total project time. It is in this stage that data in a relational database are transformed for applying a data mining technique. This stage is a complex task that demands from database designers a strong interaction with experts having a broad knowledge about the application domain. Frameworks aiming to systemize this stage have significant limitations when applied to Credit Behavioral Scoring solutions. This paper proposes a framework based on the Model Driven Development approach to systemize the mentioned stage. This work has three main contributions: 1) improving the discriminant power of data mining techniques by means of the construction of new input variables which embed temporal knowledge for the technique; 2) reducing the time of data transformation using automatic code generation, and 3) allowing artificial intelligence and statistics modelers to perform the data transformation without the help of database experts. In order to validate the proposed framework, two comparative studies were conducted. Experiments showed that the proposed framework delivers a performance equivalent or superior to those of existing frameworks and reduces the time of data transformation with a confidence level of 95%. (C) 2016 Elsevier Ltd. All rights reserved.
机译:知识发现项目的预处理阶段成本很高,通常占项目总时间的50%至80%。正是在这个阶段,关系数据库中的数据被转换以应用数据挖掘技术。此阶段是一项复杂的任务,需要数据库设计人员与对应用程序领域有广泛了解的专家进行强有力的互动。旨在将这一阶段系统化的框架在应用于“信用行为评分”解决方案时会受到重大限制。本文提出了一个基于模型驱动开发方法的框架来系统化上述阶段。这项工作有三个主要贡献:1)通过构造新的输入变量来提高数据挖掘技术的判别能力,这些新输入变量嵌入了该技术的时间知识; 2)使用自动代码生成减少数据转换的时间,并且3)允许人工智能和统计建模人员在无需数据库专家帮助的情况下执行数据转换。为了验证提议的框架,进行了两项比较研究。实验表明,提出的框架可提供与现有框架相同或更高的性能,并以95%的置信度缩短数据转换时间。 (C)2016 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号