...
首页> 外文期刊>Parallel Algorithms and Applications >Parallel and distributed clustering framework for big spatial data mining
【24h】

Parallel and distributed clustering framework for big spatial data mining

机译:大空间数据挖掘的并行和分布式集群框架

获取原文
获取原文并翻译 | 示例

摘要

Clustering techniques are very attractive for identifying and extracting patterns of interests from datasets. However, their application to very large spatial datasets presents numerous challenges such as high-dimensionality, heterogeneity, and high complexity of some algorithms. Distributed clustering techniques constitute a very good alternative to the Big Data challenges (e.g., Volume, Variety, Veracity, and Velocity). In this paper, we developed and implemented a Dynamic Parallel and Distributed clustering (DPDC) approach that can analyse Big Data within a reasonable response time and produce accurate results, by using existing and current computing and storage infrastructure, such as cloud computing. The DPDC approach consists of two phases. The first phase is fully parallel and it generates local clusters and the second phase aggregates the local results to obtain global clusters. The aggregation phase is designed in such a way that the final clusters are compact and accurate while the overall process is efficient in time and memory allocation. DPDC was thoroughly tested and compared to well-known clustering algorithms BIRCH and CURE. The results show that the approach not only produces high-quality results but also scales up very well by taking advantage of the Hadoop MapReduce paradigm or any distributed system.
机译:聚类技术对于从数据集中识别和提取兴趣模式非常有吸引力。但是,它们在超大型空间数据集上的应用带来了许多挑战,例如高维性,异构性和某些算法的高复杂性。分布式集群技术可以很好地替代大数据挑战(例如,数量,多样性,准确性和速度)。在本文中,我们开发并实现了动态并行和分布式集群(DPDC)方法,该方法可以通过使用现有和现有的计算和存储基础架构(例如云计算)在合理的响应时间内分析大数据并产生准确的结果。 DPDC方法包括两个阶段。第一阶段是完全并行的,它生成局部聚类,第二阶段汇总局部结果以获得全局聚类。聚合阶段的设计方式是,最终群集紧凑而准确,而整个过程在时间和内存分配上都很高效。 DPDC经过了彻底的测试,并与著名的聚类算法BIRCH和CURE进行了比较。结果表明,该方法不仅可以产生高质量的结果,而且可以利用Hadoop MapReduce范例或任何分布式系统很好地进行扩展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号