首页> 外文会议>International conference on emerging trends in information technology >Root Cause Analysis of Incidents Using Text Clustering and Classification Algorithms
【24h】

Root Cause Analysis of Incidents Using Text Clustering and Classification Algorithms

机译:使用文本聚类和分类算法引起事件的根本原因分析

获取原文

摘要

The purpose of this study is to cluster the injury narratives to extract the root causes behind the accidents. Analysis is done on incident data collected from the database of an integrated steel plant. Key terms generated from the clustering of incident scenario help us in finding root causes of that particular incident. This study also proposed specific measures to the management that would improve the safety performance. This study uses text document clustering to discover the hidden factors and causes behind the incidents. Understanding previous accidents is necessary to avoid future accidents. However, for companies, management of large accident databases, and accurately classifying accident narratives are very challenging issues. Therefore, the aim of this study is to accurately classify accident reports using text classification approaches and evaluate their usefulness. The study used two machine learning (ML) algorithms, namely random forest (R.F), and support vector machine (SVM) and found that SVM performed best in classifying the accident narratives. Further, SVM was experimented with different tokenization of the preprocessed narratives to get more reliable results.
机译:本研究的目的是聚集伤害叙述以提取事故背后的根本原因。分析是从集成钢铁厂的数据库收集的入射数据完成的。从事件方案的聚类产生的关键术语帮助我们找到该特定事件的根本原因。本研究还提出了对可提高安全绩效的管理的具体措施。本研究使用文本文档聚类来发现隐藏的因素并导致事件后面。理解以前的事故是必要的,以避免未来的事故。然而,对于公司的公司,大型事故数据库的管理,以及准确分类事故叙述是非常具有挑战性的问题。因此,本研究的目的是使用文本分类方法准确地分类事故报告并评估其有用性。该研究使用了两种机器学习(ML)算法,即随机森林(R.F),并支持向量机(SVM),发现SVM在分类事故叙述中最为表现。此外,SVM进行了针对预处理叙述的不同标记,以获得更可靠的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号