首页> 外文会议>International Conference on Transportation and Development >Identifying Themes in Railroad Equipment Accidents Using Text Mining and Text Visualization
【24h】

Identifying Themes in Railroad Equipment Accidents Using Text Mining and Text Visualization

机译:使用文本挖掘和文本可视化识别铁路设备事故中的主题

获取原文

摘要

Developments in text mining now allow useful information to be automatically extracted from text. The Federal Railroad Administration (FRA) publishes a database of railroad equipment accidents. These accident records contain numeric data describing the accident and a text description of the accident. This paper will discuss how Latent Dirichlet Analysis (LDA), a text-mining algorithm, can be used to identify major recurring accident topics from the text in the FRA reports. Equipment accident reports from 2005 to 2015 were studied. This analysis identified railroad grade crossing accidents with large trucks, shoving accidents, and hump yard accidents as major topics in the accident reports. An alternative method of analyzing the text, text clustering, was also used to study the FRA data. Visualizations of the text also provide useful information about the major types of railroad accidents.
机译:文本挖掘的发展现在允许从文本中自动提取有用的信息。联邦铁路管理局(FRA)发布了铁路设备事故数据库。这些事故记录包含描述事故的数字数据和事故的文本描述。本文将讨论潜在的Dirichlet分析(LDA)(一种文本挖掘算法)如何用于从FRA报告中的文本中识别主要的重复性事故主题。研究了2005年至2015年的设备事故报告。该分析将大型卡车的铁路平交道口事故,铲车事故和驼峰场事故确定为事故报告中的主要主题。分析文本的另一种方法,即文本聚类,也用于研究FRA数据。文本的可视化还提供了有关主要铁路事故类型的有用信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号