Simulated Data for Linear Regression with Structured and Sparse Penalties: Introducing pylearn-simulate

Tommy L?fstedt; Vincent Guillemot; Vincent Frouin; Edouard Duchesnay; Fouad Hadj-Selem

首页> 外文期刊>Journal of Statistical Software >Simulated Data for Linear Regression with Structured and Sparse Penalties: Introducing pylearn-simulate

【24h】

Simulated Data for Linear Regression with Structured and Sparse Penalties: Introducing pylearn-simulate

机译：具有结构罚则和稀疏罚则的线性回归的模拟数据：pylearn-simulate介绍

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A currently very active field of research is how to incorporate structure and prior knowledge in machine learning methods. It has lead to numerous developments in the field of non-smooth convex minimization. With recently developed methods it is possible to perform an analysis in which the computed model can be linked to a given structure of the data and simultaneously do variable selection to find a few important features in the data. However, there is still no way to unambiguously simulate data to test proposed algorithms, since the exact solutions to such problems are unknown. The main aim of this paper is to present a theoretical framework for generating simulated data. These simulated data are appropriate when comparing optimization algorithms in the context of linear regression problems with sparse and structured penalties. Additionally, this approach allows the user to control the signal-to-noise ratio, the correlation structure of the data and the optimization problem to which they are the solution. The traditional approach is to simulate random data without taking into account the actual model that will be fit to the data. But when using such an approach it is not possible to know the exact solution of the underlying optimization problem. With our contribution, it is possible to know the exact theoretical solution of a penalized linear regression problem, and it is thus possible to compare algorithms without the need to use, e.g., cross-validation. We also present our implementation, the Python package pylearn-simulate, available at https://github.comeurospin/pylearn-simulate and released under the BSD 3clause license. We describe the package and give examples at the end of the paper.

机译：当前非常活跃的研究领域是如何将结构和先验知识整合到机器学习方法中。它导致了非光滑凸最小化领域的众多发展。使用最近开发的方法，可以执行分析，其中可以将计算的模型链接到数据的给定结构，并同时进行变量选择以在数据中找到一些重要特征。但是，仍然没有办法明确地模拟数据来测试所提出的算法，因为未知此类问题的确切解决方案。本文的主要目的是提供一个用于生成模拟数据的理论框架。当在线性回归问题与稀疏和结构罚分的情况下比较优化算法时，这些模拟数据是合适的。另外，这种方法允许用户控制信噪比，数据的相关结构以及它们所要解决的优化问题。传统方法是模拟随机数据，而不考虑适合该数据的实际模型。但是，当使用这种方法时，不可能知道底层优化问题的确切解决方案。利用我们的贡献，有可能知道惩罚线性回归问题的确切理论解，因此可以比较算法而无需使用例如交叉验证。我们还介绍了我们的实现，即Python包pylearn-simulate，可从https://github.comeurospin/pylearn-simulate获得，并已获得BSD 3clause许可。我们在本文末尾描述了该软件包并给出了示例。

著录项

来源
《Journal of Statistical Software》 |2018年第1期|共33页
作者
Tommy L?fstedt; Vincent Guillemot; Vincent Frouin; Edouard Duchesnay; Fouad Hadj-Selem;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Cross validation in sparse linear regression with piecewise continuous nonconvex penalties and its acceleration [J] . Obuchi Tomoyuki, Sakata Ayaka Journal of physics, A. Mathematical and theoretical . 2019,第41期

机译：稀疏线性回归中的交叉验证与分段连续的非渗透惩罚及其加速度
2. Interpretable linear and nonlinear quantitative structure-selectivity relationship (QSSR) modeling of a biomimetic catalytic system by particle swarm optimization based sparse regression [J] . Xu Lu, Fu Hai-Yan, Yin Qiao-Bo, Chemometrics and Intelligent Laboratory Systems . 2016,第Null期

机译：仿生催化系统基于稀疏回归的可解释线性和非线性定量结构-选择性关系（QSSR）建模
3. Sparse Linear Regression With Structured Priors and Application to Denoising of Musical Audio [J] . Fevotte C., Torresani B., Daudet L., IEEE transactions on audio, speech and language processing . 2008,第1期

机译：结构化先验的稀疏线性回归及其在音乐音频去噪中的应用
4. A linear programming approach to sparse linear regression with quantized data [C] . V. Cerone, S. M. Fosson, D. Regruto Annual American Control Conference . 2019

机译：使用量化数据进行稀疏线性回归的线性规划方法
5. Variable screening and model selection in censored quantile regression via sparse penalties and stepwise refinement [D] . Gan, Lu 2014

机译：通过稀疏惩罚和逐步细化在可审查分位数回归中进行变量筛选和模型选择
6. Structured penalties for functional linear models—partially empirical eigenvectors for regression [O] . Timothy W. Randolph, Jaroslaw Harezlak, Ziding Feng -1

机译：结构化的惩罚为回归功能线性模型-部分经验特征向量
7. An Additive Sparse Penalty for Variable Selection in High-Dimensional Linear Regression Model [O] . Sangin Lee 2015

机译：高维线性回归模型中可变选择的添加剂稀疏罚分
8. Design of data-structure-neutral libraries for the iterative solution of sparse linear systems [R] . Smith, B. F., Gropp, W. D. 1993

机译：用于稀疏线性系统迭代求解的数据结构中立库设计

Simulated Data for Linear Regression with Structured and Sparse Penalties: Introducing pylearn-simulate

摘要

著录项

相似文献

相关主题

期刊订阅