首页> 外文会议>Basic Science International Conference >Unbiased Risk and Cross-Validation Method for Selecting Optimal Knots in Multivariable Nonparametric Regression Spline Truncated (Case Study: Unemployment Rate in Central Java, Indonesia, 2015)
【24h】

Unbiased Risk and Cross-Validation Method for Selecting Optimal Knots in Multivariable Nonparametric Regression Spline Truncated (Case Study: Unemployment Rate in Central Java, Indonesia, 2015)

机译:用于在多变量非参数回归样条上选择最佳结的无偏见风险和交叉验证方法(案例研究:中爪哇省,印度尼西亚的失业率,2015)

获取原文

摘要

Nonparametric regression gives better flexibility because the form of the regression curve estimation adjusts to the data. One nonparametric regression method is spline truncation. The number of knots and their locations affect the form of this regression curve estimation. The optimal knot is needed in order to obtain the best model. There are methods to select optimal knots, such as unbiased risk (UBR) and cross-validation (C V). This paper discusses UBR and CV, then, using both simulated data and the unemployment rate data of Central Java Province, Indonesia, in 2015, compares UBR and CV for selecting the optimal knots. The criteria for selecting the best model were based on Mean Squared Error and R-square. The simulation was performed on a spline truncated function with error generated from normal distribution for varied sample sizes and error variance. The results of the simulation study showed that CV estimates the knots more accurately than UBR. From the application to unemployment rate data, the optimal knot by using C V was a combination of 2-3-2-1-3 knot with MSE of 0.3946 and R-square of 93.047%. Meanwhile, by using UBR, the optimal knot was a three knot with MSE of 0.6865 and R-square of 90.59%. In conclusion, from the results of simulation data and application to unemployment rate data, the C V method generated a better model than the UBR method.
机译:非参数回归提供更好的灵活性,因为回归曲线估计的形式调整为数据。一个非参数回归方法是样条截断。结的数量及其位置影响了该回归曲线估计的形式。为了获得最佳模型,需要最佳结。有方法选择最佳结,例如无偏见的风险(UBR)和交叉验证(C V)。本文讨论了UBR和CV,然后,在2015年,使用模拟数据和中央爪哇省中央的失业率数据,比较了UBR和简历选择最佳结。选择最佳模型的标准基于均方误差和R范围。在曲线截断函数上执行模拟,其具有从正常分布生成的误差,用于各种样本尺寸和误差方差。仿真研究的结果表明,CV比UBR更精确地估计结。从申请到失业率数据,使用C V的最佳结是2-3-2-1-3结的组合,MSE为0.3946,R-Square为93.047%。同时,通过使用UBR,最佳结是三个结,MSE为0.6865,R-Square为90.59%。总之,从模拟数据的结果和应用于失业率数据,C V方法产生了比UBR方法更好的模型。

著录项

相似文献

  • 外文文献
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号