【24h】

Data Mining Using Unguided Symbolic Regression on a Blast Furnace Dataset

机译:高炉数据集上使用非引导符号回归的数据挖掘

获取原文

摘要

In this paper a data mining approach for variable selection and knowledge extraction from datasets is presented. The approach is based on unguided symbolic regression (every variable present in the dataset is treated as the target variable in multiple regression runs) and a novel variable relevance metric for genetic programming. The relevance of each input variable is calculated and a model approximating the target variable is created. The genetic programming configurations with different target variables are executed multiple times to reduce stochastic effects and the aggregated results are displayed as a variable interaction network. This interaction network highlights important system components and implicit relations between the variables. The whole approach is tested on a blast furnace dataset, because of the complexity of the blast furnace and the many interrelations between the variables. Finally the achieved results are discussed with respect to existing knowledge about the blast furnace process.
机译:本文提出了一种从数据集中进行变量选择和知识提取的数据挖掘方法。该方法基于无指导的符号回归(数据集中存在的每个变量在多次回归运行中均被视为目标变量)和用于遗传编程的新型变量相关性度量。计算每个输入变量的相关性,并创建一个近似目标变量的模型。多次执行具有不同目标变量的遗传编程配置,以减少随机影响,并且将汇总结果显示为变量交互网络。该交互网络突出了重要的系统组件和变量之间的隐式关系。由于高炉的复杂性以及变量之间的许多相互关系,因此在高炉数据集上测试了整个方法。最后,关于高炉工艺的现有知识讨论了获得的结果。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号