首页> 外文学位 >Traffic Crash Modeling Considering Inconsistent Observations, Interaction Behavior, and Nonlinear Relationships.
【24h】

Traffic Crash Modeling Considering Inconsistent Observations, Interaction Behavior, and Nonlinear Relationships.

机译:考虑到不一致的观察,交互行为和非线性关系的交通事故建模。

获取原文
获取原文并翻译 | 示例

摘要

Traffic collisions are a worldwide issue that can cause injury and death, which leads to billions of dollars in damages every year. Significant research efforts have been undertaken to develop and utilize statistical modeling techniques for analyzing the characteristics of crash count data. While these modeling techniques have been providing meaningful outputs, improvements on these modeling methods still need to better understand the crash risk and the contributing factors. Five important issues in crash data modeling are identified in this research. The first two issues are over or under dispersion with crash data and excess zeros within crash records. Considering that they have been well studied in the previous research, this study focuses on the remaining three major issues. The first one is relevant to the partial observations of multiple processes, i.e. crash data may be collected by different agencies that create multiple data sources and may be inconsistent. A modeling mechanism that takes advantage of all datasets for better estimation results is highly desirable. The second one is an interaction issue. Some collisions are single vehicle crashes, such as off-road crashes and rollover incidents, and some collisions involve interaction behavior, such as the Animal-Vehicle Collision (AVC) and the Vehicle-Vehicle Collision. The characteristics of crashes with interaction behavior are different from those with only one vehicle involved. It is challenging to develop a crash modeling scheme that can capture the interaction behavior. The last one is the nonlinear relationship issue. Most previous collision models are Generalized Linear Model-based (GLM-based) approaches. Such GLM-based approaches are constrained by their linear model specifications because, in most situations, the relationship between the crash rate and its contributing factors are not linear or may not even be monotonic. Thus, finding a way to model the collision data with nonlinear and non-monotonic relationships is of utmost importance.;To address the issues of inconsistent observations, two techniques are developed. A fuzzy logic-based data mapping algorithm is proposed as the first technique to match data from two datasets so that duplicate crash records can be removed when combining these datasets. The membership functions of the fuzzy logic algorithm are established based on survey inputs collected from experts of the Washington State Department of Transportation (WSDOT). As verified by expert judgment collected through another survey, the accuracy of this algorithm was approximately 90%. Applying this algorithm to the two WSDOT datasets relevant to AVC, reported AVC data and the Carcass Removal (CR) data, the combined dataset has 15% –22% more records compared to the original CR dataset. The proposed algorithm is proven effective for merging the Reported AVC data and the CR data, with a combined dataset being more complete for wildlife safety studies and countermeasure evaluations.;The second technique is a diagonal inflated bivariate Poisson regression (DIBP) method. It is an inflated version of bivariate Poisson regression model adopted to directly fit two datasets together. The proposed model technique was also applied to the reported AVC and CR data sets collected in Washington State between 2002 and 2006. The diagonal inflated bivariate Poisson model not only can model paired data with correlation, but also handle under- or over- dispersed data sets. Compared with three other types of models; double Poisson, bivariate Poisson, and zero-inflated double Poisson; the diagonal inflated bivariate Poisson model demonstrates its capability of fitting two datasets with remarkable overlapping portions resulting from the same stochastic process. Therefore, the diagonal inflated bivariate Poisson model provides researchers another new approach to investigating paired data sources from a different perspective.;To address the issues with the interaction issue, a new occurrence mechanism-based probability model, an interaction-based model, which explicitly formulates the interactions between the objects, is introduced. The proposed method was applied to the AVC data and this method can explicitly formulate the interactions between animals and drivers to better capture the relationships among drivers' and animals' attributes, roadway and environmental factors, and AVCs. Findings of this study show that the proposed occurrence mechanism-based probability model better capture the impact of drivers' and animals' attributes on the AVC. This method can be further developed to model other types of collisions with interaction behavior.;To address the nonlinear relationship issue, a Generalized Nonlinear Model (GNM)-based approach is put forward. The GNM-based approach is developed to utilize a nonlinear regression function to better elaborate non-monotonic relationships between the independent and dependent variables. Previous studies focused mainly on causal factor identification and crash risk modeling using Generalized Linear Models (GLMs), such as Poisson regression, and logistic regression among others. However, their basic assumption of a generalized linear relationship between the dependent variable (for example, crash rate) and independent variables (for example, contributing factors to crashes) established via a link function can often be violated in reality. Consequently, the GLM-based modeling results could provide biased findings and conclusions when the contributing factors have parabolic impact on the crashes. In this research, a GNM-based approach is applied with the rear end accident data and the AVC data collected from ten highway routes starting in 2002 and ending in 2006. For the rear-end collision application, the results show that truck percentage and grade have a parabolic impact: both items increase crash risks initially, but decrease risks after certain thresholds. Similarly, Annual Average Daily Traffic (AADT) and grade also have a parabolic impact on the AVC rate. Such non-monotonic relationships cannot be captured by regular GLM's, which further demonstrates the flexibility of GNM-based approaches in modeling the nonlinear relationship among data and providing more reasonable explanations. The superior GNM-based model interpretations better explain the parabolic impacts of some specific contributing factors and help in selecting and evaluating rear-end crash safety improvement plans.;In Summary, these solutions proposed to address the three major issues in crash modeling are important for crash studies. The fuzzy-logic based data mapping algorithm can combine partial observations from different processes to form up a more complete dataset for a thorough analysis. The diagonal inflated bivariate Poisson models can directly take two data observation processes into account. The occurrence mechanism based probability models and GNM based models are effective methods for handling the interaction issue and non-linear relationships between dependent and independent variables.
机译:交通事故是一个世界性的问题,可能导致人员伤亡,每年导致数十亿美元的损失。为了开发和利用统计建模技术来分析事故计数数据的特性,已经进行了大量的研究工作。尽管这些建模技术已提供了有意义的输出,但仍需要对这些建模方法进行改进,以更好地了解碰撞风险和影响因素。这项研究确定了碰撞数据建模中的五个重要问题。前两个问题是由于崩溃数据以及崩溃记录中的过多零而造成的。考虑到在先前的研究中对它们​​进行了充分的研究,所以本研究着重于其余三个主要问题。第一个与对多个过程的部分观察有关,即崩溃数据可能由创建多个数据源的不同机构收集,并且可能不一致。迫切需要一种利用所有数据集以获得更好估计结果的建模机制。第二个是互动问题。一些碰撞是单车碰撞,例如越野碰撞和侧翻事故,而某些碰撞则涉及交互行为,例如动物-车辆碰撞(AVC)和车辆-车辆碰撞。具有交互行为的碰撞特征不同于仅涉及一辆车辆的碰撞特征。开发可捕获交互行为的崩溃建模方案是一项挑战。最后一个是非线性关系问题。以前的大多数碰撞模型都是基于广义线性模型(基于GLM)的方法。这种基于GLM的方法受其线性模型规范的约束,因为在大多数情况下,崩溃率与其影响因素之间的关系不是线性的,甚至可能不是单调的。因此,找到一种建模具有非线性和非单调关系的碰撞数据的方法至关重要。为了解决观测不一致的问题,开发了两种技术。提出了一种基于模糊逻辑的数据映射算法作为匹配来自两个数据集的数据的第一种技术,以便在合并这些数据集时可以删除重复的崩溃记录。模糊逻辑算法的隶属函数是根据从华盛顿州交通运输部(WSDOT)专家那里收集的调查输入建立的。正如通过另一项调查收集的专家判断所证实的那样,该算法的准确性约为90%。将该算法应用于与AVC相关的两个WSDOT数据集,报告的AVC数据和屠体去除(CR)数据,与原始CR数据集相比,合并后的数据集的记录增加了15%–22%。该算法被证明对合并报告的AVC数据和CR数据是有效的,合并后的数据集对于野生动植物安全性研究和对策评估更为完整。第二种技术是对角膨胀双变量Poisson回归(DIBP)方法。它是双变量Poisson回归模型的放大版本,用于直接将两个数据集拟合在一起。所提出的模型技术还应用于在2002年至2006年间在华盛顿州收集的已报告的AVC和CR数据集。对角膨胀双变量Poisson模型不仅可以对具有相关性的配对数据进行建模,还可以处理分散度过低或过高的数据集。与其他三种类型的模型相比;双泊松,双变量泊松和零膨胀双泊松;对角膨胀双变量泊松模型证明了其能够拟合两个数据集的能力,这些数据集具有相同的随机过程所产生的明显重叠部分。因此,对角膨胀双变量Poisson模型为研究人员提供了另一种从不同角度研究配对数据源的新方法。为了解决与交互问题有关的问题,新的基于发生机制的概率模型,基于交互的模型明确阐述对象之间的相互作用,进行介绍。将该方法应用于AVC数据,该方法可以明确地公式化动物与驾驶员之间的相互作用,以更好地捕捉驾驶员与动物属性,道路和环境因素以及AVC之间的关系。这项研究的结果表明,所提出的基于发生机制的概率模型可以更好地捕获驾驶员和动物属性对AVC的影响。可以进一步开发该方法,以模拟具有交互行为的其他类型的碰撞。;解决非线性关系问题提出了一种基于广义非线性模型的方法。开发了基于GNM的方法,以利用非线性回归函数更好地阐述自变量和因变量之间的非单调关系。先前的研究主要集中在使用广义线性模型(GLM)进行因果关系识别和碰撞风险建模,例如Poisson回归和Logistic回归。但是,实际上常常会违反通过链接函数建立的因变量(例如,崩溃率)和自变量(例如,导致崩溃的因素)之间的广义线性关系的基本假设。因此,当影响因素对事故造成抛物线影响时,基于GLM的建模结果可能会提供有偏差的发现和结论。在这项研究中,基于GNM的方法应用于从2002年开始至2006年结束的10条高速公路路线的后端事故数据和AVC数据。对于后端碰撞应用,结果显示卡车的百分比和坡度产生抛物线影响:两个项目最初都会增加碰撞风险,但在达到一定阈值后会降低碰撞风险。同样,年平均每日流量(AADT)和等级也会对AVC率产生抛物线影响。常规的GLM无法捕获这种非单调的关系,这进一步证明了基于GNM的方法在对数据之间的非线性关系进行建模并提供更合理的解释方面的灵活性。基于GNM的出色模型解释可以更好地解释某些特定影响因素的抛物线影响,并有助于选择和评估后端碰撞安全改进计划。总之,为解决碰撞建模中的三个主要问题而提出的这些解决方案对于碰撞研究。基于模糊逻辑的数据映射算法可以结合来自不同过程的部分观测结果,以形成更完整的数据集以进行全面的分析。对角膨胀双变量泊松模型可以直接考虑两个数据观测过程。基于发生机制的概率模型和基于GNM的模型是处理交互问题以及因变量和自变量之间的非线性关系的有效方法。

著录项

  • 作者

    Lao, Yunteng.;

  • 作者单位

    University of Washington.;

  • 授予单位 University of Washington.;
  • 学科 Engineering Civil.;Transportation.
  • 学位 Ph.D.
  • 年度 2013
  • 页码 193 p.
  • 总页数 193
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-17 11:41:36

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号