首页> 外文学位 >Investigating the effects of sample size, model misspecification and underreporting in crash data on three commonly used traffic crash severity models.
【24h】

Investigating the effects of sample size, model misspecification and underreporting in crash data on three commonly used traffic crash severity models.

机译:在三个常用的交通事故严重性模型上,研究样本大小,模型规格不正确以及崩溃数据漏报的影响。

获取原文
获取原文并翻译 | 示例

摘要

Numerous studies have documented the application of crash severity models to explore the relationship between crash severity and its contributing factors. These studies have shown that a large amount of work was conducted on this topic and usually focused on different types of models. However, only a limited amount of research has compared the performance of different crash severity models. Additionally, three major issues related to the modeling process for crash severity analysis have not been sufficiently explored: sample size, model misspecification and underreporting in crash data. Therefore, in this research, three commonly used traffic crash severity models: multinomial logit model (MNL), ordered probit model (OP) and mixed logit model (ML) were studied in terms of the effects of sample size, model misspecification and underreporting in crash data, via a Monte-Carlo approach using simulated and observed crash data.;The results of sample size effects on the three models are consistent with prior expectations in that small sample sizes significantly affect the development of crash severity models, no matter which model type is used. Furthermore, among the three models, the ML model was found to require the largest sample size, while the OP model required the lowest sample size. In addition, when the sample size is sufficient, the results of model misspecification analysis lead to the following suggestions: in order to decrease the bias and variability of estimated parameters, logit models should be selected over probit models. Meanwhile, it was suggested to select more general and flexible model such as those allowing randomness in the parameters, i.e., the ML model. Another important finding was that the analysis of the underreported data for the three models showed that none of the three models was immune to this underreporting issue. However, setting data properly could minimize the bias and variability. Furthermore, when the full or partial information about the unreported rates for each severity level is known, treating crash data as outcome-based samples in model estimation, via the Weighted Exogenous Sample Maximum Likelihood Estimator (WESMLE), dramatically improve the estimation for all three models.
机译:许多研究已经记录了碰撞严重性模型的应用,以探索碰撞严重性及其影响因素之间的关系。这些研究表明,针对此主题进行了大量工作,并且通常侧重于不同类型的模型。但是,只有有限的研究比较了不同碰撞严重性模型的性能。此外,与碰撞严重性分析的建模过程相关的三个主要问题尚未得到充分研究:样本大小,模型错误指定和碰撞数据漏报。因此,在本研究中,针对样本量,模型错误指定和报告不足的影响,研究了三种常用的交通事故严重性模型:多项式logit模型(MNL),有序概率模型(OP)和混合logit模型(ML)。通过使用模拟和观察到的碰撞数据的蒙特卡洛方法,获得碰撞数据;三个模型的样本量影响结果与先前的预期一致,因为无论哪种模型,小样本量都会显着影响碰撞严重性模型的开发类型被使用。此外,在这三个模型中,发现ML模型需要最大的样本量,而OP模型需要的最小样本量。另外,当样本量足够大时,模型错误指定分析的结果会导致以下建议:为了减少估计参数的偏差和可变性,应该选择Logit模型而不是概率模型。同时,建议选择更通用和灵活的模型,例如那些允许参数随机的模型,即ML模型。另一个重要发现是,对这三个模型的报告不足数据的分析表明,这三个模型都无法幸免。但是,正确设置数据可以使偏差和可变性最小化。此外,当了解有关每个严重性级别的未报告比率的全部或部分信息时,通过加权外生样本最大似然估计器(WESMLE)将崩溃数据视为模型估计中基于结果的样本,可以显着改善这三个方面的估计楷模。

著录项

  • 作者

    Ye, Fan.;

  • 作者单位

    Texas A&M University.;

  • 授予单位 Texas A&M University.;
  • 学科 Statistics.;Engineering Civil.
  • 学位 Ph.D.
  • 年度 2011
  • 页码 169 p.
  • 总页数 169
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-17 11:45:13

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号