首页> 外文会议>IEEE International Conference on Automation Science and Engineering >A self-organizing method for predictive modeling with highly-redundant variables
【24h】

A self-organizing method for predictive modeling with highly-redundant variables

机译:具有高度冗余变量的自组织预测模型的方法

获取原文

摘要

Rapid advancement of sensing and information technology brings the big data, which presents a gold mine of the 21st century. However, big data also brings significant challenges for data-driven decision making. In particular, it is not uncommon that a large number of variables (or features) underlie the big data. Complex interdependence structures among variables challenge the traditional framework of predictive modeling. This paper presents a new methodology of self-organizing network for variable clustering and predictive modeling. Specifically, we developed a new approach, namely nonlinear coupling analysis to measure nonlinear interdependence structures among variables. Further, all the variables are embedded as nodes in a complex network. Nonlinear-coupling forces move these nodes to derive a self-organizing topology of network. As such, variables are clustered as sub-network communities in the space. Experimental results demonstrated that the proposed methodology not only outperforms traditional variable clustering algorithms such as hierarchical clustering and oblique principal component analysis, but also effectively identify interdependent structures among variables and further improves the performance of predictive modeling. The proposed new idea of self-organizing network is generally applicable for predictive modeling in many disciplines that involve a large number of highly-redundant variables.
机译:传感和信息技术的飞速发展带来了大数据,这代表了21世纪的金矿。但是,大数据也给以数据为依据的决策带来了巨大挑战。特别是,大数据背后存在大量变量(或特征)的情况并不少见。变量之间复杂的相互依存结构挑战了传统的预测建模框架。本文提出了一种用于变量聚类和预测建模的自组织网络新方法。具体来说,我们开发了一种新方法,即非线性耦合分析,以测量变量之间的非线性相互依存结构。此外,所有变量都作为复杂网络中的节点嵌入。非线性耦合力使这些节点移动,以得出网络的自组织拓扑。这样,变量被聚集为该空间中的子网社区。实验结果表明,所提出的方法不仅优于传统的变量聚类算法,例如层次聚类和倾斜主成分分析,而且可以有效地识别变量之间的相互依存结构,从而进一步提高了预测建模的性能。提出的自组织网络新思想通常适用于涉及大量高度冗余变量的许多学科中的预测建模。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号