首页> 外文会议>International conference on big data analytics >Khanan: Performance Comparison and Programming α-Miner Algorithm in Column-Oriented and Relational Database Query Languages
【24h】

Khanan: Performance Comparison and Programming α-Miner Algorithm in Column-Oriented and Relational Database Query Languages

机译:Khanan:面向列和关系数据库查询语言中的性能比较和编程α - 矿物算法

获取原文

摘要

Process-Aware Information Systems (PAIS) support business processes and generate large amounts of event logs from the execution of business processes. An event log is represented as a tuple of CaselD, Timestamp, Activity and Actor. Process Mining is a new and emerging field that aims at analyzing the event logs to discover, enhance and improve business processes and check conformance between run time and design time business processes. The large volume of event logs generated are stored in the databases. Relational databases perform well for a certain class of applications. However, there is a certain class of applications for which relational databases are not able to scale well. To address the challenges of scalability, NoSQL database systems emerged. Discovering a process model (workflow) from event logs is one of the most challenging and important Process Mining tasks. The α-miner algorithm is one of the first and most widely used Process Discovery techniques. Our objective is to investigate which of the databases (Relational or NoSQL) performs better for a Process Discovery application under Process Mining. We implement the α-miner algorithm on relational (row-oriented) and NoSQL (column-oriented) databases in database query languages so that our application is tightly coupled to the database. We conduct a performance benchmarking and comparison of the α-miner algorithm on row-oriented database and NoSQL column-oriented database. We present the comparison on various aspects like time taken to load large datasets, disk usage, stepwise execution time and compression technique.
机译:处理感知信息系统(PAI)支持业务流程并从执行业务流程中生成大量事件日志。事件日志表示为Caseld,Timestamp,Activity和Actor的元组。流程挖掘是一个新的和新兴领域,旨在分析事件日志,以发现,增强和改进业务流程,并检查运行时和设计时间业务流程之间的一致性。生成的大量事件日志存储在数据库中。关系数据库对某类应用程序表现良好。但是,存在某种应用程序,其中关系数据库无法划衡。为了解决可扩展性的挑战,NoSQL数据库系统出现了。从事件日志中发现流程模型(工作流程)是最具挑战性和重要的过程挖掘任务之一。 α-矿工算法是第一种和最广泛使用的过程发现技术之一。我们的目标是调查哪些数据库(关系或NoSQL)在流程挖掘下的进程发现申请表现更好。我们在数据库查询语言中实现了关于关系(面向行)和NoSQL(面向列)的数据库的α-Miner算法,以便我们的应用程序紧密耦合到数据库。我们进行行进数据库和NoSQL面向列数据库中α矿工算法的性能基准和比较。我们在加载大型数据集,磁盘使用情况,逐步执行时间和压缩技术中的各个方面提供了比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号