A Parallel Algorithm for Mining Density-Aware Distinguishing Sequential Patterns with Spark

机译：一种利用Spark挖掘密度识别序列模式的并行算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Distinguishing sequential pattern (DSP) mining is a useful technique to discriminate a set of sequences of one class against a set of sequences of another class. One kind of DSP that, considers the density concept in DSP mining (called density-aware DSP) has many applications in bioinformatics and computational biology. However, the previous method to mine density-aware DSPs suffers from the inefficient density computing. As a result, the previous method cannot deal with the datasets with large scale. To break this limitation, we design and implement a parallel mining method to discover density-aware DSPs using Spark, which is a popular framework for parallel computing. Our empirical study on real datasets demonstrates that our proposed method is efficient and scalable.

机译：区分顺序模式（DSP）挖掘是一种有用的技术，可将一个类别的序列集与另一类别的序列集区分开。一种在DSP挖掘中考虑密度概念的DSP（称为密度感知DSP）在生物信息学和计算生物学中有许多应用。但是，用于挖掘密度感知型DSP的先前方法存在密度计算效率低下的问题。结果，先前的方法不能大规模处理数据集。为了克服此限制，我们设计并实现了一种并行挖掘方法，以使用Spark（这是一种流行的并行计算框架）来发现密度感知型DSP。我们对真实数据集的实证研究表明，我们提出的方法是有效且可扩展的。

著录项

来源
《International Conference on Advanced Cloud and Big Data》|2016年|144-149|共6页
会议地点
作者
Pan Qin; Lei Duan; Tianqing Zhang; Pu Wang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Digital signal processing; Sparks; Data mining; Algorithm design and analysis; Itemsets; Electronic mail; Big data;

机译：数字信号处理;火花;数据挖掘;算法设计与分析;项目集;电子邮件;大数据;

相似文献

外文文献
中文文献
专利

1. Scalable and parallel sequential pattern mining using spark [J] . Yu Xiao, Li Qing, Liu Jin World Wide Web . 2019,第1期

机译：使用Spark进行可扩展的并行顺序模式挖掘
2. A STUDY OF SEQUENTIAL PATTERN MINING ALGORITHMS FOR USE IN DETECTION OF USER ACTIVITY PATTERNS [J] . MAXIM DUNAEV, KONSTANTIN ZAYTSEV, MIKHAIL TITOV Journal of Theoretical and Applied Information Technology . 2018,第13期

机译：用于检测用户活动模式的顺序模式挖掘算法的研究
3. The Impact of the Pattern-Growth Ordering on the Performances of Pattern Growth-Based Sequential Pattern Mining Algorithms [J] . Kenmogne Edith Belise Computer and information science . 2017,第1期

机译：模式增长顺序对基于模式增长的顺序模式挖掘算法性能的影响
4. A Parallel Algorithm for Mining Density-Aware Distinguishing Sequential Patterns with Spark [C] . Pan Qin, Lei Duan, Tianqing Zhang, International Conference on Advanced Cloud and Big Data . 2016

机译：采矿密度感知与火花的挖掘密度感知序列模式的并行算法
5. Fast algorithms for mining association rules and sequential patterns. [D] . Srikant, Ramakrishnan. 1996

机译：用于挖掘关联规则和顺序模式的快速算法。
6. An Efficient Incremental Mining Algorithm for Discovering Sequential Pattern in Wireless Sensor Network Environments [O] . Xin Lyu, Hongxu Ma 2019

机译：在无线传感器网络环境中发现顺序模式的高效增量挖掘算法
7. Mining Algorithms for Sequential Patterns in Parallel: Hash Based Approach [O] . Takahiko Shintani, Masaru Kitsuregawa 1998

机译：并行模式中顺序模式的挖掘算法：基于哈希的方法
8. SLPMiner: An Algorithm for Finding Frequent Sequential Patterns Using Length-Decreasing Support Constraint [R] . Seno, M. , Karypis, G. 2002

机译：sLpminer：一种利用长度减小支持约束寻找频繁序列模式的算法

A Parallel Algorithm for Mining Density-Aware Distinguishing Sequential Patterns with Spark

摘要

著录项

相似文献

相关主题

期刊订阅