A Self-organizing Method for Predictive Modeling with Highly-redundant Variables

机译：一种具有高冗余变量的预测建模的自组织方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Rapid advancement of sensing and information technology brings the big data, which presents a gold mine of the 21st century. However, big data also brings significant challenges for data-driven decision making. In particular, it is not uncommon that a large number of variables (or features) underlie the big data. Complex interdependence structures among variables challenge the traditional framework of predictive modeling. This paper presents a new methodology of self-organizing network for variable clustering and predictive modeling. Specifically, we developed a new approach, namely nonlinear coupling analysis to measure nonlinear interdependence structures among variables. Further, all the variables are embedded as nodes in a complex network. Nonlinear-coupling forces move these nodes to derive a self-organizing topology of network. As such, variables are clustered as sub-network communities in the space. Experimental results demonstrated that the proposed methodology not only outperforms traditional variable clustering algorithms such as hierarchical clustering and oblique principal component analysis, but also effectively identify interdependent structures among variables and further improves the performance of predictive modeling. The proposed new idea of self-organizing network is generally applicable for predictive modeling in many disciplines that involve a large number of highly-redundant variables.

机译：传感和信息技术的快速发展带来的大数据，其中介绍了21世纪的金矿。然而，大数据也带来了数据驱动的决策显著的挑战。特别是，它的情况并不少见，大量的变量（或功能）背后的大数据。变量之间的相互依存关系的复杂挑战的结构预测模型的传统框架。本文呈现的自组织网络的新方法变量聚类和预测建模。具体来说，我们开发了一种新的方法，即非线性耦合分析测量变量之间的非线性相互依存的结构。此外，所有的变量都嵌入作为在复杂的网络节点。非线性耦合力将这些节点以获得网络的自组织拓扑。因此，变量都聚集在空间的子网络社区。实验结果表明，所提出的方法不仅优于传统的可变的聚类算法，例如等级聚类和倾斜主成分分析，但也有效地识别变量间相互依存的结构和进一步提高预测建模的性能。自组织网络提出的新想法是普遍适用于涉及大量高度冗余变量的许多学科的预测建模。

著录项

来源
《IEEE International Conference on Automation Science and Engineering》|2015年||共6页
会议地点
作者
Gang Liu; Hui Yang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP2-53;
关键词

相似文献

外文文献
中文文献
专利

1. Comparison of variable selection methods in predictive models applied to near-infrared and genomic data [J] . R.A.Ferreira, L.A.Peternelli Genetics and Molecular Research . 2021,第3期

机译：在近红外和基因组数据应用预测模型中的可变选择方法的比较
2. Comparison of variable selection methods for clinical predictive modeling [J] . Sanchez-Pinto L. Nelson, Venable Laura Ruth, Fahrenbach John, International journal of medical informatics . 2018,第AUGa期

机译：用于临床预测建模的变量选择方法的比较
3. Automated variable selection methods for logistic regression produced unstable models for predicting acute myocardial infarction mortality. [J] . Austin PC, Tu JV Journal of Clinical Epidemiology . 2004,第11期

机译：用于逻辑回归的自动变量选择方法产生了不稳定的模型，用于预测急性心肌梗死的死亡率。
4. A self-organizing method for predictive modeling with highly-redundant variables [C] . Liu Gang, Yang Hui IEEE International Conference on Automation Science and Engineering . 2015

机译：具有高度冗余变量的自组织预测模型的方法
5. Penalization Methods for Group Identification and Variable Selection in Models with Correlated Predictors. [D] . Sharma, Dhruv Bhushan. 2010

机译：具有相关预测变量的模型中用于组识别和变量选择的惩罚方法。
6. Examining variable selection methods for the predictive performance of regression models and the proportion of selected variables and selected random variables [O] . Hiromasa Kaneko 2021

机译：检查回归模型的预测性能的变量选择方法以及所选变量的比例和选择的随机变量
7. Comparison of variable selection methods for clinical predictive modeling [O] . L. Nelson Sanchez-Pinto, Laura Ruth Venable, John Fahrenbach, 2018

机译：临床预测建模可变选择方法的比较

A Self-organizing Method for Predictive Modeling with Highly-redundant Variables

摘要

著录项

相似文献

相关主题

期刊订阅