首页> 外文OA文献 >Some Inference Problems in Clustered (Longitudinal) Count Data with Over-dispersion
【2h】

Some Inference Problems in Clustered (Longitudinal) Count Data with Over-dispersion

机译:过度分散的聚类(纵向)计数数据中的一些推理问题

摘要

Clustered (includes longitudinal) count data arise in many bio-statistical practices in which a number of repeated responses are observed over time from a number of individuals. One important problem that arises in practice is to test homogeneity within clusters (individuals) and between clusters (individuals). As data within clusters are observations of repeated responses, the count data may be correlated and/or over-dispersed. Jacqmin-Gadda and Commenges (1995) derive a score test statistic H_S by assuming a random intercept model within the framework of the generalized linear mixed model by obtaining exact variance of the likelihood score under the null hypothesis of homogeneity and a score test statistic H_T using the generalized estimating equation (GEE) approach (Liang and Zeger, 1986; Zeger and Liang, 1986). They further show that the two tests are identical when the covariance matrix assumed in the GEE approach is that of the random-effects model. In each of these cases they deal with (a) the situation in which the dispersion parameter $phi$ is assumed to be known and (b) the situation in which the dispersion parameter $phi$ is assumed to be unknown. The second situation, however, is more realistic as $phi$ will be unknown in practice. For over-dispersed count data with unknown over-dispersion parameter we use the score test procedure of Rao (1947) and derive three tests by assuming a random intercept model within the framework of (i) the over-dispersed generalized linear model (ii) the negative binomial model, and (iii) the double extended quasi likelihood model (Lee and Nelder, 2001). All these three statistics are much simpler than the statistic obtained from the statistic $H_S$ derived by Jacqmin-Gadda and Commenges (1995) under the framework of the over-dispersed generalized linear mixed effects model. The second statistic takes the over-dispersion more directly into the model and therefore is expected to do well when the model assumptions are satisfied and the other statistics are expected to be robust. Simulations show superior level property of the statistics derived under the negative binomial and double extended quasi-likelihood model assumptions. Further, two score tests have been developed to test for over-dispersion in the generalized linear mixed model. The four score tests of homogeneity and the two score tests for detecting over-dispersion are applied to two real life data examples. A plan for future study is given.
机译:聚集(包括纵向)计数数据出现在许多生物统计学实践中,其中随着时间的推移,许多个人观察到许多重复的响应。在实践中出现的一个重要问题是测试集群内部(个体)和集群之间(个体)的同质性。由于簇内的数据是重复响应的观察结果,因此计数数据可能是相关的和/或过度分散的。 Jacqmin-Gadda和Commenges(1995)通过在广义线性混合模型的框架内假设随机截距模型来获得得分检验统计量H_S,方法是在同质性零假设下获得似然得分的精确方差,并使用广义估计方程(GEE)方法(Liang和Zeger,1986; Zeger和Liang,1986)。他们进一步表明,当GEE方法中假设的协方差矩阵是随机效应模型的协方差矩阵时,这两个检验是相同的。在这些情况的每一个中,他们处理(a)假定色散参数$ phi $为已知的情况,以及(b)假定色散参数$ phi $为未知的情况。但是,第二种情况更为现实,因为$ phi $在实践中是未知的。对于具有未知超分散参数的超分散计数数据,我们使用Rao(1947)的得分测试程序,并通过假设(i)超分散广义线性模型(ii)框架内的随机拦截模型来推导三个测试负二项式模型,以及(iii)双重扩展拟似然模型(Lee and Nelder,2001)。所有这三个统计量都比在过度分散的广义线性混合效应模型的框架下从Jacqmin-Gadda和Commenges(1995)导出的统计量$ H_S $获得的统计量简单得多。第二个统计量将过度分散直接带入模型,因此,在满足模型假设且其他统计量预期可靠的情况下,预期效果很好。仿真表明,在负二项式和双重扩展拟似然模型假设下得出的统计量具有较高的水平性质。此外,已经开发了两个得分测试来测试广义线性混合模型中的过度分散。将四个同质性得分测试和两个用于检测过度分散的得分测试应用于两个现实数据示例。给出了未来研究的计划。

著录项

  • 作者

    Azad Kazi;

  • 作者单位
  • 年度 2011
  • 总页数
  • 原文格式 PDF
  • 正文语种
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号