首页> 外文期刊>Wirtschaftsinformatik >Optimizing Data Stream Representation: An Extensive Survey on Stream Clustering Algorithms
【24h】

Optimizing Data Stream Representation: An Extensive Survey on Stream Clustering Algorithms

机译:优化数据流表示形式:对流聚类算法的广泛调查

获取原文
获取原文并翻译 | 示例
       

摘要

Analyzing data streams has received considerable attention over the past decades due to the widespread usage of sensors, social media and other streaming data sources. A core research area in this field is stream clustering which aims to recognize patterns in an unordered, infinite and evolving stream of observations. Clustering can be a crucial support in decision making, since it aims for an optimized aggregated representation of a continuous data stream over time and allows to identify patterns in large and high-dimensional data. A multitude of algorithms and approaches has been developed that are able to find and maintain clusters over time in the challenging streaming scenario. This survey explores, summarizes and categorizes a total of 51 stream clustering algorithms and identifies core research threads over the past decades. In particular, it identifies categories of algorithms based on distance thresholds, density grids and statistical models as well as algorithms for high dimensional data. Furthermore, it discusses applications scenarios, available software and how to configure stream clustering algorithms. This survey is considerably more extensive than comparable studies, more up-to-date and highlights how concepts are interrelated and have been developed over time.
机译:在过去的几十年中,由于传感器,社交媒体和其他流数据源的广泛使用,分析数据流受到了相当大的关注。该领域的核心研究领域是流聚类,旨在识别无序,无限且不断发展的观测流中的模式。聚类可以成为决策过程中的关键支持,因为聚类的目的是随着时间的推移优化连续数据流的聚合表示,并允许识别大型和高维数据中的模式。已经开发了多种算法和方法,它们能够在具有挑战性的流传输场景中随时间查找和维护集群。这项调查探索,总结和归类了总共51种流聚类算法,并确定了过去几十年的核心研究线程。特别是,它基于距离阈值,密度网格和统计模型以及用于高维数据的算法来标识算法的类别。此外,它还讨论了应用场景,可用软件以及如何配置流聚类算法。这项调查比可比的研究要广泛得多,而且是最新的,并强调了概念是如何相互关联并随着时间的推移而发展的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号