首页> 外文会议>IEEE International Conference on Big Data >Distributed Mining of Spatial High Utility Itemsets in Very Large Spatiotemporal Databases using Spark In-Memory Computing Architecture

【24h】

Distributed Mining of Spatial High Utility Itemsets in Very Large Spatiotemporal Databases using Spark In-Memory Computing Architecture

机译：使用Spark In-Memory Computing Architecture在非常大的时空数据库中分布挖掘空间高实用程序项集

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Finding Spatial High Utility Itemsets (SHUIs) in a spatiotemporal database is a challenging problem of great importance in many real-world applications. Most previous works focused on the sequential discovery of SHUIs in a database running on a single machine. Consequently, these works are not suitable for big data (or cloud-based) applications as they suffer from the scalability and fault tolerant problems. This paper proposes several novel pruning techniques to reduce the search space and present a more flexible distributed algorithm to find all desired itemsets from the database using Spark in-memory computing architecture. Our algorithm inherits several advantages of Spark, including low communication cost, fault tolerance, and high scalability. Experimental results demonstrate that the proposed algorithm has good scalability and performance on very large databases. Finally, we present a real-world navigation application in which SHUIs generated from the traffic congestion data have been employed to recommend alternative routes to the users.

机译：在Spatiotemporal数据库中寻找空间高实用项目集（Shuis）是许多真实应用中非常重要的挑战性问题。最先前的作品专注于Shuis在单台机器上运行的数据库中的顺序发现。因此，这些作品不适用于大数据（或基于云的）应用，因为它们遭受可伸缩性和容错问题。本文提出了几种新颖的修剪技术来减少搜索空间，并呈现更灵活的分布式算法，以使用火花内存计算架构从数据库中找到所有所需的项目集。我们的算法继承了火花的几个优点，包括低通信成本，容错和高可扩展性。实验结果表明，所提出的算法在非常大的数据库中具有良好的可扩展性和性能。最后，我们提出了一个真实的导航应用程序，其中已经采用了从流量拥塞数据生成的SHUIS推荐给用户的替代路由。

著录项

来源
《IEEE International Conference on Big Data 》|2020年|4724-4733|共10页
会议地点
作者
R. Uday Kiran; Sadanori Ito; Minh-Son Dao; Koji Zettsu; Cheng-Wei Wu; Yukata Watanobe; Incheon Paik; Truong Cong Thang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Fault tolerance; Itemsets; Scalability; Fault tolerant systems; Big Data; Spatiotemporal phenomena; Sparks;

机译：容错;项目集;可扩展性;容错系统;大数据;时尚现象;火花;

相似文献

外文文献
中文文献
专利

1. Implementation of Efficient Algorithm for Mining High Utility Itemsets in Distributed and Dynamic Database [J] . G. Saranya, A.Deepakkumar International Journal of Engineering and Technology . 2014 ,第5期

机译：分布式动态数据库中高效实用项集挖掘的高效算法的实现
2. Multi-level Utility Mining: Retrieval of High Utility Itemsets in a Transaction Database [J] . Sivamathi C., Vijayarani S. Computers and Electrical Engineering . 2019 ,第期

机译：多级实用程序挖掘：在交易数据库中检索高实用程序项集
3. Sequential Pattern Mining Databases using High Utility Rare Itemset Mining Algorithm using Temporal [J] . Priyadharshini S. P., Hemalatha M. International Journal of Applied Engineering Research . 2019 ,第8aPta1期

机译：使用时间使用高实用工具稀有项目组挖掘算法的顺序模式挖掘数据库
4. Parallel Mining of Top-k High Utility Itemsets in Spark In-Memory Computing Architecture [C] . Chun-Han Lin, Cheng-Wei Wu, JianTao Huang, Pacific-Asia Conference on Knowledge Discovery and Data Mining . 2019

机译：Spark内存计算架构中Top-k高实用项目集的并行挖掘
5. Mining Frequent Itemsets Using Improved Apriori on Spark [D] . Khandelwal, Ashutosh. 2017

机译：在Spark上使用改进的Apriori挖掘频繁项集
6. HUIL-TN HUI-TN: Mining high utility itemsets based on pattern-growth [O] . Le Wang, Shui Wang 2021

机译：Huil-Tn＆Hui-TN：基于模式增长的矿业高实用项目集
7. An Efficient Distributed Frequent Itemset Mining Algorithm Based on Spark for Big Data [O] . Yassir Rochd, Imad Hafidi 2019

机译：基于大数据的火花的高效分布式频繁项目集挖掘算法
8. Distributed Database Components in a DBMS (Database Management System) Component Architecture [R] . Manola, F. A. 1984

机译：DBms（数据库管理系统）组件体系结构中的分布式数据库组件

Distributed Mining of Spatial High Utility Itemsets in Very Large Spatiotemporal Databases using Spark In-Memory Computing Architecture

摘要

著录项

相似文献

相关主题

期刊订阅