A minera??o de dados e a qualidade de conhecimentos extraídos dos boletins de ocorrência das rodovias federais brasileiras

Jefferson de Jesus Costa; Flávia Cristina Bernardini; José Viterbo Filho

首页> 外文期刊>AtoZ : Novas Práticas em Informa??o e Conhecimento >A minera??o de dados e a qualidade de conhecimentos extraídos dos boletins de ocorrência das rodovias federais brasileiras

【24h】

A minera??o de dados e a qualidade de conhecimentos extraídos dos boletins de ocorrência das rodovias federais brasileiras

机译：从巴西联邦高速公路警察报告中提取的数据挖掘和知识质量

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Introduction: This paper presents and analyzes the results obtained when applying Data Mining process in the bulletins of occurrences of the Brazilian federal highways generated by the Federal Highway Police (PRF) in 2012. The purpose of this work is to analyze the feasibility of implementing the Data Mining process on data provided by PRF in order to identify associations between variables related to transit accidents in all Brazilian federal highways. Method: It was used symbolic supervised learning algorithms, as well as an algorithm of generation of association rules, implemented in Weka tool. Regarding the database, it was used the records of 2012. On this portion of the database it was conducted the step of data preprocessing, which were used for extracting models and patterns in the Weka tool and, lastly, evaluated the models and extracted patterns. Results: In supervised learning, the results obtained with J48 and PART algorithms have been considered promising due to the fact that for all classes of accidents causes, the values of area under the ROC curve (AUC) were above 0.5. Furthermore, using the Apriori algorithm there have been generated 38 association rules with confidence greater than 0.8. Conclusions: It was concluded that is important to propose a model for data distribution of this database, in order to use it for data mining process, as well as other knowledge extraction tasks and decision making. It was noted still, the need to improve the quality of data to be provided from the initial stage of data gathering, that is, in the very systems used to record the data.

机译：简介：本文介绍并分析了在2012年联邦高速公路警察（PRF）生成的巴西联邦高速公路发生情况公告中应用数据挖掘过程时获得的结果。该工作的目的是分析实施该方法的可行性。对PRF提供的数据进行数据挖掘过程，以识别与巴西所有联邦公路中的交通事故相关的变量之间的关联。方法：使用符号监督学习算法以及在Weka工具中实现的关联规则生成算法。关于数据库，使用了2012年的记录。在数据库的这一部分上，进行了数据预处理的步骤，该步骤用于在Weka工具中提取模型和模式，最后评估模型和提取的模式。结果：在监督学习中，由于对于所有类别的事故原因，ROC曲线下的面积值（AUC）均大于0.5，因此使用J48和PART算法获得的结果被认为很有希望。此外，使用Apriori算法，已经生成了38个置信度大于0.8的关联规则。结论：结论是重要的是，为该数据库的数据分布提出一个模型，以便将其用于数据挖掘过程以及其他知识提取任务和决策。仍然注意到，需要提高从数据收集的初始阶段即在用于记录数据的系统中要提供的数据的质量。

著录项

来源
《AtoZ : Novas Práticas em Informa??o e Conhecimento》 |2014年第2期|共19页
作者
Jefferson de Jesus Costa; Flávia Cristina Bernardini; José Viterbo Filho;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类新闻学、新闻事业;
关键词

相似文献

外文文献
中文文献
专利

1. EFICIêNCIA NAS UNIVERSIDADES FEDERAIS BRASILEIRA: UMA APLICA??O DA ANáLISE ENVOLTóRIA DE DADOS [J] . Maria de los Angeles Martinez Cohen, Adriano Nascimento Paix?o, Nilton Marques Oliveira Revista GEPEC . 2018,第1期

机译：巴西联邦大学的效率：数据参与分析的应用
2. Uso de análise envoltória de dados para mensurar eficiência temporal de rodovias federais concessionadas [J] . Guilherme Henrique Ismael De Azevedo, Jo?o Carlos Correia Baptista Soares De Mello, Juliana Quintanilha Da Silveira, Journal of Transport Literature . 2012,第1期

机译：使用数据包络分析来衡量联邦特许权道路的时间效率
3. EXTRA??O DE CONHECIMENTO ATRAVéS DA MINERA??O DE DADOS [J] . Revista de Engenharia e Tecnologia . 2010,第2期

机译：通过数据挖掘提取知识
4. IMPACTO DA QUALIDADE DOS DADOS NO PLANEJAMENTO DE CURTO PRAZO [C] . Cristina da Paixao Araujo, Marcel Antonio Arcari Bassani, Joao Felipe Coimbra Leite Costa ABM International Congress . 2014

机译：短期规划中数据质量的影响
5. O princípio da eficiência em contabilidade pública: a aloca??o de recursos públicos para a gera??o de educa??o e saúde nos estados brasileiros. [D] . Soares, Marilene Feitosa. 2019

机译：公共会计效率原则：在巴西国家在教育和健康中分配公共资源。
6. Vigilância epidemiológica da transmissão vertical da sífilis: dados de seis unidades federativas no Brasil [O] . Valeria Saraceni, Gerson Fernando Mendes Pereira, Mariangela Freitas da Silveira, 2017

机译：梅毒垂直传播的流行病学监测：来自巴西六个联邦单位的数据
7. Extração da informação e produção de conhecimento por meio da mineração de dados [O] . Nadi Helena Presser, Eli Lopes da Silva 2018

机译：通过数据挖掘提取信息和知识生产

A minera??o de dados e a qualidade de conhecimentos extraídos dos boletins de ocorrência das rodovias federais brasileiras

摘要

著录项

相似文献

相关主题

期刊订阅