首页> 美国政府科技报告 >The Influence of Selected Factors on Shrinkage and Overfit in Multiple Correlation
【24h】

The Influence of Selected Factors on Shrinkage and Overfit in Multiple Correlation

机译:选择因素对多重相关收缩过盈的影响

获取原文

摘要

Weighting of variables in a regression equation so as to maximize prediction of a criterion presents several problems. Optimal weighting in the sample case means that chance-related error is also weighted indiscriminately. Because such error will not relate to the criterion in subsequent samples, a sample multiple correlation (R) will be on the average larger than the population value (overfit), and its value on cross-validation will be lower than in the quation-development sample (shrinkage). The influence of characteristics of the population and other conditions of the sampling situation on the outcome and stability of the regression equation has not been well understood. In particular, the role played by the relationship of initial predictor set size (M) to sample size (N) has not received adequate attention. The report attempted to examine and isolate the role of sampling error in the magnitude and stability of sample multiple R values obtained by incremental test selection techniques. The effect of selected factors on the impact of sampling error was examined. Three proposed shrinkage estimation formulas were evaluated for effectiveness, and a search was conducted for more efficient formulas incorporating the M/N ratio. Method of controlling shrinkage and overfit were discussed and evaluated. (Author)

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号