首页> 外文会议>International Conference on Logistics, Informatics and Service Sciences >Short Text Mining for Fault Diagnosis of Railway System Based on Multi-Granularity Topic Model
【24h】

Short Text Mining for Fault Diagnosis of Railway System Based on Multi-Granularity Topic Model

机译:基于多粒度主题模型的铁路系统故障诊断短文本挖掘

获取原文

摘要

Vehicle equipment is one of the core equipments of train control system in the high-speed railway, and it mainly depends on the experience of maintenance personnel to perform fault diagnosis when it fails, which is ineffective. The maintenance text data of vehicle equipment, which contains fault category information, is not fully utilized. The classification of maintenance text can well assist fault diagnosis of vehicle equipment. However, shortness and imbalanced class distribution of maintenance texts, which hinder the application of conventional text representation models and classification algorithms, pose challenges for classification task. In this paper, we propose a novel text feature selection algorithm based on multi-granularity latent Dirichlet allocation (LDA) to overcome shortness characteristic of maintenance text. To solve the problem of class imbalance, a cost-sensitive Support Vector Machine (SVM) is utilized to construct fault diagnosis model. Finally, we compare our proposed method with the state-of-the-art baseline over a vehicle maintenance text data set collected by Guangzhou railway corporation, and it outperforms traditional methods.
机译:车辆设备是高速铁路列车控制系统的核心设备之一,故障时主要依靠维修人员的经验进行故障诊断,效果不佳。包含故障类别信息的车辆设备的维护文本数据未被充分利用。维护文本的分类可以很好地辅助车辆设备的故障诊断。然而,维护文本的简短和不均衡的类别分布阻碍了常规文本表示模型和分类算法的应用,给分类任务带来了挑战。在本文中,我们提出了一种基于多粒度潜在狄利克雷分配(LDA)的文本特征选择算法,以克服维护文本的短性特征。为了解决类不平衡的问题,利用成本敏感的支持向量机(SVM)构建故障诊断模型。最后,我们在广州铁路公司收集的车辆维修文本数据集上,将我们提出的方法与最新的基线进行了比较,其性能优于传统方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号