首页> 外文会议>International Conference on Intelligent Data Analysis >Sampling of Highly Correlated Data for Polynomial Regression and Model Discovery
【24h】

Sampling of Highly Correlated Data for Polynomial Regression and Model Discovery

机译:多项式回归和模型发现的高度相关数据的抽样

获取原文

摘要

The usual way of conducting empirical comparisons among competing polynomial model selection criteria is by generating artificial data from created true models with specified link weights. The robustness of each model selection criterion is then judged by its ability to recover the true model from its sample data sets with varying sizes and degrees of noise. If we have a set of multivariate real data and have empirically found a polynomial regression model that is so far seen as the right model represented by the data, we would like to be able to replicate the multivariate data artificially to enable us to run multiple experiments to achieve two objectives. First, to see if the model selection criteria can recover the model that is seen to be the right model. Second, to find out the minimum sample size required to recover the right model. This paper proposes a methodology to replicate real multivariate data using its covariance matrix and a polynomial regression model seen as the right model represented by the data. The sample data sets generated are then used for model discovery experiments.
机译:在竞争多项式选择标准中进行实证比较的通常方法是通过从具有指定链接权重的创建的真实模型生成人工数据。然后通过其能够从其样本数据集恢复真实模型的能力与具有不同大小和噪声程度的能力来判断每个模型选择标准的鲁棒性。如果我们有一组多变量真实数据并且已经经验发现了一个远远被视为数据所代表的右模型的多项式回归模型,我们希望能够人工复制多变量数据,使我们能够运行多个实验实现两个目标。首先,要查看模型选择标准是否可以恢复被视为正确模型的模型。其次,找出恢复右模型所需的最小示例大小。本文提出了一种使用其协方差矩阵复制真实多变量数据的方法,以及作为数据表示的正确模型的多项式回归模型。然后生成的样本数据集用于模型发现实验。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号