【24h】

Efficient regression analysis with ranked-set sampling.

机译:使用排序集抽样进行有效的回归分析。

获取原文
获取原文并翻译 | 示例
           

摘要

Summary. This article is motivated by a lung cancer study where a regression model is involved and the response variable is too expensive to measure but the predictor variable can be measured easily with relatively negligible cost. This situation occurs quite often in medical studies, quantitative genetics, and ecological and environmental studies. In this article, by using the idea of ranked-set sampling (RSS), we develop sampling strategies that can reduce cost and increase efficiency of the regression analysis for the above-mentioned situation. The developed method is applied retrospectively to a lung cancer study. In the lung cancer study, the interest is to investigate the association between smoking status and three biomarkers: polyphenol DNA adducts, micronuclei, and sister chromatic exchanges. Optimal sampling schemes with different optimality criteria such as A-, D-, and integrated mean square error (IMSE)-optimality are considered in the application. With set size 10 in RSS, the improvement of the optimal schemes over simple random sampling (SRS) is great. For instance, by using the optimal scheme with IMSE-optimality, the IMSEs of the estimated regression functions for the three biomarkers are reduced to about half of those incurred by using SRS.
机译:概要。本文是受一项肺癌研究的启发而进行的,该研究涉及回归模型,响应变量过于昂贵以至于无法测量,但预测变量可以很容易地以相对可忽略的成本进行测量。这种情况在医学研究,定量遗传学以及生态和环境研究中经常发生。在本文中,我们使用排序集抽样(RSS)的思想,开发了可以降低成本并提高上述情况的回归分析效率的抽样策略。所开发的方法被追溯应用于肺癌研究。在肺癌研究中,研究兴趣是研究吸烟状况与三种生物标志物之间的关系:多酚DNA加合物,微核和姐妹色交换。应用中考虑了具有不同最佳标准(例如A,D和综合均方误差(IMSE)-最佳性)的最佳采样方案。在RSS中将大小设置为10时,相对于简单随机采样(SRS)而言,最佳方案的改进是很大的。例如,通过使用具有IMSE优化的最佳方案,三个生物标记的估计回归函数的IMSE减少到使用SRS所产生的IMSE的一半。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号