首页> 外文会议>IEEE International Conference on Automation Science and Engineering >A Self-organizing Method for Predictive Modeling with Highly-redundant Variables
【24h】

A Self-organizing Method for Predictive Modeling with Highly-redundant Variables

机译:一种具有高冗余变量的预测建模的自组织方法

获取原文

摘要

Rapid advancement of sensing and information technology brings the big data, which presents a gold mine of the 21st century. However, big data also brings significant challenges for data-driven decision making. In particular, it is not uncommon that a large number of variables (or features) underlie the big data. Complex interdependence structures among variables challenge the traditional framework of predictive modeling. This paper presents a new methodology of self-organizing network for variable clustering and predictive modeling. Specifically, we developed a new approach, namely nonlinear coupling analysis to measure nonlinear interdependence structures among variables. Further, all the variables are embedded as nodes in a complex network. Nonlinear-coupling forces move these nodes to derive a self-organizing topology of network. As such, variables are clustered as sub-network communities in the space. Experimental results demonstrated that the proposed methodology not only outperforms traditional variable clustering algorithms such as hierarchical clustering and oblique principal component analysis, but also effectively identify interdependent structures among variables and further improves the performance of predictive modeling. The proposed new idea of self-organizing network is generally applicable for predictive modeling in many disciplines that involve a large number of highly-redundant variables.
机译:传感和信息技术的快速发展带来的大数据,其中介绍了21世纪的金矿。然而,大数据也带来了数据驱动的决策显著的挑战。特别是,它的情况并不少见,大量的变量(或功能)背后的大数据。变量之间的相互依存关系的复杂挑战的结构预测模型的传统框架。本文呈现的自组织网络的新方法变量聚类和预测建模。具体来说,我们开发了一种新的方法,即非线性耦合分析测量变量之间的非线性相互依存的结构。此外,所有的变量都嵌入作为在复杂的网络节点。非线性耦合力将这些节点以获得网络的自组织拓扑。因此,变量都聚集在空间的子网络社区。实验结果表明,所提出的方法不仅优于传统的可变的聚类算法,例如等级聚类和倾斜主成分分析,但也有效地识别变量间相互依存的结构和进一步提高预测建模的性能。自组织网络提出的新想法是普遍适用于涉及大量高度冗余变量的许多学科的预测建模。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号