...
首页> 外文期刊>Mathematical Theory and Modeling >Models for Count Data in the Presence of Outliers and/or Excess Zero
【24h】

Models for Count Data in the Presence of Outliers and/or Excess Zero

机译:存在异常值和/或多余零的计数数据模型

获取原文
           

摘要

Violations of Poisson assumptions usually result in overdispersion, where the variance of the model exceeds the value of the mean. Excess or (deficiency) of zero counts result in overdispersion. Violations of equidispersion indicate correlation in the data, which affect standard errors of the parameter estimates. Model fit is also affected. (Hilbe 2008). Therefore, this study examined the impact of outliers and excess zero on count data in causing overdispersion. The study focus on identifying model(s) which can handle the impact of outliers and excess zero in count data. Datasets based on Poisson model were simulated for sample sizes 20, 50 and 100 and incorporated with outliers and excess zero. Maximum likelihood estimation method was employed in estimating the parameters. Model selection is based on dispersion index, AIC, BIC and log likelihood statistics, putting into consideration Poisson, Negative Binomial, Zero Inflated Poisson and Zero Inflated Negative Binomial models and results obtained indicates that ZINB is the best models for analyzing count data in the presence of outliers and/or excess zero. Keywords: Count data, Overdispersion, Excess zero, outliers, Goodness of fit, Poisson, Negative Binomial and Zero inflated models
机译:违反泊松假设通常会导致过度分散,其中模型的方差超过均值。零计数的过多或不足会导致分散。违反等散表明数据中存在相关性,这会影响参数估计的标准误差。模型拟合也会受到影响。 (Hilbe 2008)。因此,本研究考察了异常值和过量零对计数数据造成过度分散的影响。该研究的重点是确定可以处理异常值和计数数据中过多零的影响的模型。对基于Poisson模型的数据集进行了模拟,其样本量分别为20、50和100,并包含异常值和过量零。在参数估计中采用了最大似然估计方法。模型选择基于色散指数,AIC,BIC和对数似然统计,并考虑了泊松,负二项式,零膨胀泊松和零膨胀负二项式模型,并且获得的结果表明ZINB是存在下分析计数数据的最佳模型离群值和/或多余的零。关键字:计数数据,过度分散,过零,离群值,拟合优度,泊松,负二项式和零膨胀模型

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号