首页> 外文期刊>International Journal of Public Information Systems >Knowledge Discovery in Road Accidents Database - Integration of Visual and Automatic Data Mining Methods
【24h】

Knowledge Discovery in Road Accidents Database - Integration of Visual and Automatic Data Mining Methods

机译:道路事故数据库中的知识发现-可视和自动数据挖掘方法的集成

获取原文
       

摘要

Road accident statistics are collected and used by a large number of users and this can result in a huge volume of data which requires to be explored in order to ascertain the hidden knowledge. Potential knowledge may be hidden because of the accumulation of data, which limits the exploration task for the road safety expert and, hence, reduces the utilization of the database. In order to assist in solving these problems, this paper explores Automatic and Visual Data Mining (VDM) methods. The main purpose is to study VDM methods and their applicability to knowledge discovery in a road accident databases. The basic feature of VDM is to involve the user in the exploration process. VDM uses direct interactive methods to allow the user to obtain an insight into and recognize different patterns in the dataset. In this paper, I apply a range of methods and techniques, including a paradigm for VDM, exploratory data analysis, and clustering methods, such as K-means algorithms, hierarchical agglomerative clustering (HAC), classification trees, and self-organized-maps (SOM). These methods assist in integrating VDM with automatic data mining algorithms. Open source VDM tools offering visualization techniques were used. The first contribution of this paper lies in the area of discovering clusters and different relationships (such as the relationship between socioeconomic indicators and fatalities, traffic risk and population, personal risk and car per capita, etc.) in the road safety database. The methods used were very useful and valuable for detecting clusters of countries that share similar traffic situations. The second contribution was the exploratory data analysis where the user can explore the contents and the structure of the data set at an early stage of the analysis. This is supported by the filtering components of VDM. This assists expert users with a strong background in traffic safety analysis to be able to intimate assumptions and hypotheses concerning future situations. The third contribution involved interactive explorations based on brushing and linking methods; this novel approach assists both the experienced and inexperienced users to detect and recognize interesting patterns in the available database. The results obtained showed that this approach offers a better understanding of the contents of road safety databases, with respect to current statistical techniques and approaches used for analyzing road safety situations.
机译:道路事故统计数据被大量用户收集和使用,这可能会导致需要探查大量数据以确定隐藏的知识。由于数据的积累,可能隐藏了潜在的知识,这限制了道路安全专家的探索任务,因此减少了数据库的利用率。为了帮助解决这些问题,本文探索了自动和可视数据挖掘(VDM)方法。主要目的是研究VDM方法及其在道路事故数据库中发现知识的适用性。 VDM的基本功能是使用户参与探索过程。 VDM使用直接的交互方法,使用户可以洞察并识别数据集中的不同模式。在本文中,我应用了多种方法和技术,包括VDM范例,探索性数据分析和聚类方法,例如K-means算法,分层聚类聚类(HAC),分类树和自组织图(SOM)。这些方法有助于将VDM与自动数据挖掘算法集成在一起。使用了提供可视化技术的开源VDM工具。本文的第一点贡献在于在道路安全数据库中发现集群和不同关系(例如社会经济指标与死亡人数,交通风险与人口,人身风险与人均汽车等之间的关系)。所使用的方法对于检测交通状况相似的国家集群非常有用且有价值。第二个贡献是探索性数据分析,用户可以在分析的早期阶段探索数据集的内容和结构。 VDM的筛选组件支持此功能。这有助于在交通安全分析方面具有深厚背景的专家用户能够得出有关未来情况的假设和假设。第三项贡献涉及基于刷和链接方法的交互式探索;这种新颖的方法可以帮助有经验的用户和没有经验的用户在可用数据库中检测和识别有趣的模式。获得的结果表明,相对于用于分析道路安全状况的当前统计技术和方法,该方法可以更好地理解道路安全数据库的内容。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号