首页> 外文会议>International Conference on Information and Knowledge Engineering >An Approach for Mining Frequent Sequences in Distributed Environment
【24h】

An Approach for Mining Frequent Sequences in Distributed Environment

机译:分布式环境中频繁序列采矿的方法

获取原文
获取外文期刊封面目录资料

摘要

Discovering meaningful information from data is what is known as data mining. By meaningful information we mean many mining problems such as association rules, frequent sequences, or clusters. So far, attention in a data mining process has always focused on extracting information from centralized data or data physically located at one single site. However, there are some applications in which data distribution in different sites is a need or mandatory. In this case, the process embraces additional complexity. One aspect of complexity concerns with how to adapt or innovate mining algorithms to extract information from distributed data rather than central one. Another problem, concerns with setting a mechanism for communication involved in the distributed environment without incurring significant overhead. This paper is a preliminary study for mining distributed sequences in light of current mining algorithms. Based on detailed comparisons of algorithms in sequences mining area, we propose a distributed approach to follow for implementation of sequences discovery.
机译:从数据中发现有意义的信息是所谓的数据挖掘。通过有意义的信息,我们的意思是许多挖掘问题,例如关联规则,频繁序列或集群。到目前为止,在数据挖掘过程中的注意力始终专注于从物理位于一个单一站点的集中数据或数据中提取信息。但是,存在一些应用程序,其中不同站点中的数据分布是需要或强制性的。在这种情况下,该过程包括额外的复杂性。复杂性的一个方面有关如何适应或创新挖掘算法以从分布式数据而不是中央数据提取信息。另一个问题,对设置分布式环境中涉及的通信机制而不产生显着的开销的问题。本文是初步研究当前采矿算法的挖掘分布序列。基于序列矿区算法的详细比较,提出了一种分布式方法来遵循序列发现的实施。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号