Subset selection for multiple linear regression via optimization

Park Young Woong; Klabjan Diego

首页> 外文期刊>Journal of Global Optimization >Subset selection for multiple linear regression via optimization

【24h】

Subset selection for multiple linear regression via optimization

机译：通过优化的多个线性回归的子集选择

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Subset selection in multiple linear regression aims to choose a subset of candidate explanatory variables that tradeoff fitting error (explanatory power) and model complexity (number of variables selected). We build mathematical programming models for regression subset selection based on mean square and absolute errors, and minimal-redundancy-maximal-relevance criteria. The proposed models are tested using a linear-program-based branch-and-bound algorithm with tailored valid inequalities and big M values and are compared against the algorithms in the literature. For high dimensional cases, an iterative heuristic algorithm is proposed based on the mathematical programming models and a core set concept, and a randomized version of the algorithm is derived to guarantee convergence to the global optimum. From the computational experiments, we find that our models quickly find a quality solution while the rest of the time is spent to prove optimality; the iterative algorithms find solutions in a relatively short time and are competitive compared to state-of-the-art algorithms; using ad-hoc big M values is not recommended.

机译：多个线性回归中的子集选择旨在选择权衡拟合误差（解释性电源）和模型复杂性（所选变量数）的候选解释性变量的子集。我们基于均线和绝对误差和最小冗余最大关联标准来构建用于回归子集选择的数学编程模型。使用基于线性程序的分支和绑定算法测试所提出的模型，具有量身定制的有效不等式和大M值，并与文献中的算法进行比较。对于高维例，基于数学编程模型和核心集合概念提出了一种迭代启发式算法，并且导出了算法的随机版本，以保证到全局最优的融合。从计算实验中，我们发现我们的模型很快找到了质量解决方案，而其余时间则花费了证明了最优性;迭代算法在相对较短的时间内找到解决方案，与最先进的算法相比是竞争力的;不建议使用ad-hoc大m值。

著录项

来源
《Journal of Global Optimization》 |2020年第3期|543-574|共32页
作者
Park Young Woong; Klabjan Diego;
展开▼
作者单位

Iowa State Univ Ivy Coll Business Ames IA 50011 USA;

Northwestern Univ Dept Ind Engn & Management Sci Evanston IL 60208 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Multiple linear regression; Subset selection; High dimensional data; Mathematical programming; Linearization;

机译：多线性回归;子集选择;高维数据;数学编程;线性化;

相似文献

外文文献
中文文献
专利

1. An efficient optimization approach for best subset selection in linear regression, with application to model selection and fitting in autoregressive time-series [J] . Di Gangi Leonardo, Lapucci M., Schoen F., Computational optimization and applications . 2019,第3期

机译：一种有效的优化方法，用于最佳的Linear回归中选择的优化方法，应用于自回归时间序列中的选择和拟合
2. Subset selection in multiple linear regression in the presence of outlier and multicollinearity [J] . Nileshkumar H.Jadhav, Dattatraya N. Kashid, Subhash R. Kulkarni Statistical Methodology . 2014,第JULa期

机译：存在离群和多重共线性的多元线性回归中的子集选择
3. Subset selection in multiple linear regression models: A hybrid of genetic and simulated annealing algorithms [J] . Hasan ?rkcü H. Applied mathematics and computation . 2013,第23期

机译：多个线性回归模型中的子集选择：遗传和模拟退火算法的混合
4. A Local-branching Heuristic for the Best Subset Selection Problem in Linear Regression [C] . Tamara Bigler, Oliver Strub IEEE International Conference on Industrial Engineering and Engineering Management . 2018

机译：线性回归中最佳子集选择问题的局部分支启发法
5. A COMPARISON OF SIX MODELS FOR PREDICTING CORPORATE BANKRUPTCY: MULTIPLE LINEAR REGRESSION ANALYSIS, MULTIPLE LINEAR DISCRIMINANT ANALYSIS, STEPWISE REGRESSION ANALYSIS, STEPWISE DISCRIMINANT ANALYSIS, MULTIPLE LINEAR REGRESSION ANALYSIS WITH RIDGE REGRESSION, AND MULTIPLE LINEAR DISCRIMINANT ANALYSIS WITH BIASED MINIMUM CHI-SQUARE RULE [D] . MAPP, JOHNNIE ALBERT. 1981

机译：六种预测公司破产的模型的比较：多个线性回归分析，多个线性判别分析，逐步回归分析，逐步判别分析，多个带岭点回归的线性回归分析，以及多个线性离散
6. Comparison of subset selection methods in linear regression in the context of health-related quality of life and substance abuse in Russia [O] . Olga Morozova, Olga Levina, Anneli Uusküla, 2015

机译：与健康相关的生活质量和药物滥用在俄罗斯进行线性回归的子集选择方法的比较
7. Subset selection for multiple linear regression via optimization [O] . Young Woong Park, Diego Klabjan 2020

机译：通过优化的多个线性回归的子集选择

Subset selection for multiple linear regression via optimization

摘要

著录项

相似文献

相关主题

期刊订阅