【24h】

Truncated Trace Classifier. Removal of Incomplete Traces from Event Logs

机译:截断的跟踪分类器。从事件日志中删除不完整的跟踪

获取原文

摘要

We consider truncated traces, which are incomplete sequences of events. This typically happens when dealing with streaming data or when the event log extraction process cuts the end of the trace. The existence of truncated traces in event logs and their negative impacts on process mining outcomes have been widely acknowledged in the literature. Still, there is a lack of research on algorithms to detect them. We propose the Truncated Trace Classifier (TTC), an algorithm that distinguishes truncated traces from the ones that are not truncated. We benchmark 5 TTC implementations that use either LSTM or XGBOOST on 13 real-life event logs. Accurate TTCs have great potential. In fact, filtering truncated traces before applying a process discovery algorithm greatly improves the precision of the discovered process models, by 9.1%. Moreover, we show that TTCs increase the accuracy of a next event prediction algorithm by up to 7.5%.
机译:我们考虑截断的痕迹,这是事件的不完整序列。当处理流数据或事件日志提取过程中断跟踪的末尾时,通常会发生这种情况。在事件日志中截断痕迹的存在及其对过程挖掘结果的负面影响已在文献中得到广泛认可。但是,仍然缺乏有关检测算法的研究。我们提出了截断的轨迹分类器(TTC),一种将截断的轨迹与未截断的轨迹区分开的算法。我们对13个真实事件日志中使用LSTM或XGBOOST的5个TTC实现进行了基准测试。准确的TTC潜力巨大。实际上,在应用过程发现算法之前过滤截断的迹线可以极大地提高发现的过程模型的精度,达到9.1%。此外,我们表明TTC将下一事件预测算法的准确性提高了7.5%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号