首页> 外文会议>International Conference on Intelligent Data Analysis >Sampling of Highly Correlated Data for Polynomial Regression and Model Discovery

【24h】

Sampling of Highly Correlated Data for Polynomial Regression and Model Discovery

机译：多项式回归和模型发现的高度相关数据的抽样

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The usual way of conducting empirical comparisons among competing polynomial model selection criteria is by generating artificial data from created true models with specified link weights. The robustness of each model selection criterion is then judged by its ability to recover the true model from its sample data sets with varying sizes and degrees of noise. If we have a set of multivariate real data and have empirically found a polynomial regression model that is so far seen as the right model represented by the data, we would like to be able to replicate the multivariate data artificially to enable us to run multiple experiments to achieve two objectives. First, to see if the model selection criteria can recover the model that is seen to be the right model. Second, to find out the minimum sample size required to recover the right model. This paper proposes a methodology to replicate real multivariate data using its covariance matrix and a polynomial regression model seen as the right model represented by the data. The sample data sets generated are then used for model discovery experiments.

机译：在竞争多项式选择标准中进行实证比较的通常方法是通过从具有指定链接权重的创建的真实模型生成人工数据。然后通过其能够从其样本数据集恢复真实模型的能力与具有不同大小和噪声程度的能力来判断每个模型选择标准的鲁棒性。如果我们有一组多变量真实数据并且已经经验发现了一个远远被视为数据所代表的右模型的多项式回归模型，我们希望能够人工复制多变量数据，使我们能够运行多个实验实现两个目标。首先，要查看模型选择标准是否可以恢复被视为正确模型的模型。其次，找出恢复右模型所需的最小示例大小。本文提出了一种使用其协方差矩阵复制真实多变量数据的方法，以及作为数据表示的正确模型的多项式回归模型。然后生成的样本数据集用于模型发现实验。

著录项

来源
《International Conference on Intelligent Data Analysis 》|2001年||共8页
会议地点
作者
Grace W. Rumantir; Chris S. Wallace;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词
Sampling from multivariate normal distribution; Polynomial model discovery; Tropical cyclone intensity forecasting modelling;

机译：从多变量正态分布采样;多项式模型发现;热带气旋强度预测建模;

相似文献

外文文献
中文文献
专利

1. Bayesian analysis of semiparametric Bernstein polynomial regression models for data with sample selection [J] . Kim Hea-Jung, Roh Taeyoung, Choi Taeryon Statistics . 2019 ,第4a6期

机译：具有样本选择的数据的半参数伯恩斯坦多项式回归模型的贝叶斯分析
2. Linear Models for Airborne-Laser-Scanning-Based Operational Forest Inventory With Small Field Sample Size and Highly Correlated LiDAR Data [J] . Junttila Virpi, Kauranne Tuomo, Finley Andrew O., Geoscience and Remote Sensing, IEEE Transactions on . 2015 ,第10期

机译：基于机载激光扫描的经营性森林清单的线性模型，该模型具有较小的现场样本量和高度相关的LiDAR数据
3. Improving extreme wind speed prediction based on a short data sample, using a highly correlated long data sample [J] . Gaidai Oleg, Naess Arvid, Xu Xiaosen, Journal of Wind Engineering and Industrial Aerodynamics: The Journal of the International Association for Wind Engineering . 2019 ,第期

机译：使用高度相关的长数据样本，基于短数据样本提高极端风速预测
4. Sampling of Highly Correlated Data for Polynomial Regression and Model Discovery [C] . Grace W. Rumantir, Chris S. Wallace International Conference on Intelligent Data Analysis . 2001

机译：多项式回归和模型发现的高度相关数据的抽样
5. Goodness -of -fit for logistic regression models developed using data collected from a complex sampling design [D] . Archer, Kellie Jo. 2001

机译：使用从复杂抽样设计中收集的数据开发的逻辑回归模型的拟合优度
6. Unweighted regression models perform better than weighted regression techniques for respondent-driven sampling data: results from a simulation study [O] . Lisa Avery, Nooshin Rotondi, Constance McKnight, 2019

机译：对于受访者驱动的采样数据非加权回归模型的性能优于加权回归技术：模拟研究的结果
7. Figure 6: Boxplots comparing the performance of Partial Least Square Regression (PLSR) and Cubist regression tree models in predicting soil properties using various calibration sampling size and sampling algorithms within the regional dataset. [O] . -1

机译：图6：Boxpots比较了偏最小二乘回归（PLSR）和立体师回归树模型的性能在使用各种校准采样大小和区域数据集中的采样算法预测土壤属性中的性能。

Sampling of Highly Correlated Data for Polynomial Regression and Model Discovery

摘要

著录项

相似文献

相关主题

期刊订阅