首页> 外文会议>International conference on emerging trends in information technology >Mining Frequent Patterns with Temporal Effect: A Case of Accident Path Analysis
【24h】

Mining Frequent Patterns with Temporal Effect: A Case of Accident Path Analysis

机译:采矿频繁模式具有时间效应:事故路径分析的情况

获取原文

摘要

Frequent item-set generation is usually applied on large databases to mine item-sets recurring together. The objective of such analysis is to identify patterns in dataset which are of managerial interests. In cases where the dataset is spread over significant period of time, the results can sometime be misleading. The reasons are twofold: (ⅰ) pattern existing with high frequency at certain time period and later experiencing decreasing trend will appear in result and (ⅱ) pattern present in recent time periods with increasing trend but not with significantly high frequency will not appear in result. Due to this the results become bias to time effect. In the proposed work, we have included the time effect in data during pattern mining by exponentially weighing the dataset i.e., the more recent the data, higher the weight assigned to it. The method is applied to accident path components from incident records of an integrated steel plant. The results from the proposed work and traditional frequent item-set mining are compared and results are inferred as: (ⅰ) the patterns present only in traditional method's result are the accident patterns which are under control and have effective risk control system (RCS), (ⅱ) patterns present in both the results are paths where either RCS is absent or ineffective, and (ⅲ) the patterns present only in proposed method's result will have higher probability to recur in future. The proposed method provides new direction in interpretation of pattern mining from large dataset of long period.
机译:频繁项集的生成通常适用于大型数据库矿项目集重复在一起。这种分析的目的是确定在数据集的模式是其中的管理利益。在数据集被分布在时间显著周期的情况下,其结果可能有时会产生误导。的原因是双重的:(ⅰ)与在一定的时间段的高频图案现有后来经历减少的趋势将出现在结果和图案存在于具有增加的趋势最近时间段(ⅱ),但不与显著高频不会出现在结果。由于这种结果变成偏置时间效应。在建议的工作,我们已在数据模式挖掘过程中成倍称重数据集,即越近的数据,分配给它的权重越高的时间效应。该方法是从一个综合性钢铁厂的事件记录施加到事故路径组件。从建议的工作和传统的频繁项集挖掘的结果进行比较,并作为结果推断:(ⅰ)模式目前只在传统方法的结果是意外的模式,其是在控制之下,并有有效的风险控制系统(RCS), (ⅱ)存在于两个图案的结果是路径其中或者RCS为不存在或无效,和(ⅲ)的图案呈现仅在提出的方法的结果将有较高的概率在未来复发。所提出的方法提供了在从大的数据集长周期的模式挖掘的解释新的方向。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号