首页> 外文会议>OnTheMove International Federated Conference >Towards Event Log Querying for Data Quality Let's Start with Detecting Log Imperfections
【24h】

Towards Event Log Querying for Data Quality Let's Start with Detecting Log Imperfections

机译:朝着事件日志查询数据质量,让我们开始检测日志缺陷

获取原文
获取外文期刊封面目录资料

摘要

Process mining is, by now, a well-established discipline focussing on process-oriented data analysis. As with other forms of data analysis, the quality and reliability of insights derived through analysis is directly related to the quality of the input (garbage in - garbage out). In the case of process mining, the input is an event log comprised of event data captured (in information systems) during the execution of the process. It is crucial then that the event log be treated as a first-class citizen. While data quality is an easily understood concept little effort has been directed towards systematically detecting data quality issues in event logs. Analysts still spend a large proportion of any project in 'data cleaning', often involving manual and ad hoc tasks, and requiring more than one tool. While there are existing tools and languages that query event logs, the problem of different approaches for different log imperfections remains. In this paper we take the first steps to developing QUELI (Querying Event Log for Imperfections) a log query language that provides direct support for detecting log imperfections. We develop an approach that identifies capabilities required of QUELI and illustrate the approach by applying it to 5 of the 11 event log imperfection patterns described in [29]. We view this as a first step towards operationalising systematic, automated support for log cleaning.
机译:流程挖掘是现在,一个既熟悉的学科,重点是面向过程的数据分析。与其他形式的数据分析一样,通过分析的洞察力的质量和可靠性与输入的质量直接相关(垃圾垃圾垃圾)。在处理挖掘的情况下,输入是在执行过程中捕获(在信息系统中)的事件数据组成的事件日志。这是至关重要的,那么事件日志被视为一流的公民。虽然数据质量是一种易于理解的概念,但朝着在事件日志中系统地检测数据质量问题的努力很少。分析师仍花费大量的任何项目在“数据清洁”中,通常涉及手动和临时任务,并要求多个工具。虽然存在查询事件日志的现有工具和语言,但仍然存在不同日志缺陷的不同方法的问题。在本文中,我们采取了开发Queli(查询事件日志以获取缺陷)的第一个步骤,该语言可以直接支持检测日志缺陷。我们开发一种方法,该方法识别Queli所需的能力,并通过将[29]中描述的11个事件日志缺陷模式中的5个应用程序应用于5。我们将此视为迈向运营系统,自动支持对日志清洁的第一步。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号