首页> 美国卫生研究院文献>Environmental Health Perspectives >Methodological problems arising from the choice of an independent variable in linear regression with application to an air pollution epidemiological study.
【2h】

Methodological problems arising from the choice of an independent variable in linear regression with application to an air pollution epidemiological study.

机译:在线性回归中选择自变量引起的方法学问题并应用于空气污染流行病学研究。

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

In epidemiological studies using linear regression, it is often necessary for reasons of economy or unavailability of data to use as the independent variable not the variable ideally demanded by the hypothesis under study but some convenient practical approximation to it. We show that if the correlation coefficient between the "practical" and "ideal" variables can be obtained, then a range of uncertainty can be obtained within which the desired regression coefficient of dependent on "ideal" variable may lie. This range can be quite wide, even if the practical and ideal variables are fairly well correlated. These points are illustrated with data on observed regression coefficients from an air pollution epidemiological study, in which pollution measured at one station in a large metropolitan area (containing 40 aerometric stations) was used as the practical approximation to the city-wide average pollution. The uncertainties in the regression coefficients were found to exceed the regression coefficients themselves by large factors. The problem is one that may afflict application of linear regression in general, and suggests caution when selecting independent variables for regression analysis on the basis of convenience, rather than relevance to the hypotheses tested.
机译:在使用线性回归的流行病学研究中,出于经济原因或数据不可用的原因,经常有必要将其用作自变量,而不是所研究假设理想地要求的变量,而应将其用作一些方便的实际近似值。我们表明,如果可以获得“实际”和“理想”变量之间的相关系数,那么就可以获得不确定性范围,其中取决于“理想”变量的期望回归系数可能位于其中。即使实际变量和理想变量之间具有很好的相关性,该范围也可能很大。这些点用空气污染流行病学研究中观察到的回归系数数据进行了说明,在该研究中,在大都市区(包含40个航空气象站)的一个站点测得的污染被用作对全市平均污染的实际近似值。发现回归系数的不确定性超出了大因素本身的回归系数。这个问题通常可能会影响线性回归的应用,并建议在基于方便性而不是与检验假设相关的基础上选择自变量进行回归分析时要谨慎。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号