Distributed clustering of ubiquitous data streams

Rodrigues Pedro Pereira; Gama Joao

首页> 外文期刊>Wiley interdisciplinary reviews. Data mining and knowledge discovery >Distributed clustering of ubiquitous data streams

【24h】

Distributed clustering of ubiquitous data streams

机译：普遍存在的数据流的分布式集群

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Nowadays information is generated and gathered from distributed streaming data sources, stressing communications and computing infrastructure, making it hard to transmit, compute, and store. Knowledge discovery from ubiquitous data streams has become a major goal for all sorts of applications, mostly based on unsupervised techniques such as clustering. Two subproblems exist: clustering streaming data observations and clustering streaming data sources. The former searches for dense regions of the data space, identifying hot spots where data sources tend to produce data, while the latter finds groups of sources that behave similarly over time. In order to better assess the current status of this topic, this article presents a thorough review on distributed algorithms addressing either of the subproblems. We characterize clustering algorithms for ubiquitous data streams, discussing advantages and disadvantages of distributed procedures. Overall, distributed stream clustering methods improve communication ratios, processing speed, and resources consumption, while achieving similar clustering validity as the centralized counterparts. (C) 2013 John Wiley & Sons, Ltd.

机译：如今，信息是从分布式流数据源生成和收集的，这给通信和计算基础结构带来了压力，使其难以传输，计算和存储。从无处不在的数据流中发现知识已经成为各种应用程序的主要目标，这些应用程序大多基于诸如群集之类的无监督技术。存在两个子问题：对流数据观测进行聚类和对流数据源进行聚类。前者搜索数据空间的密集区域，确定数据源倾向于生成数据的热点，而后者则发现随时间变化表现相似的源组。为了更好地评估此主题的当前状态，本文对解决任一子问题的分布式算法进行了全面的回顾。我们表征了无处不在的数据流的聚类算法，讨论了分布式过程的优缺点。总体而言，分布式流聚类方法提高了通信比率，处理速度和资源消耗，同时实现了与集中式同类方法相似的聚类有效性。（C）2013 John Wiley＆Sons，Ltd.

著录项

来源
《Wiley interdisciplinary reviews. Data mining and knowledge discovery》 |2014年第1期|共17页
作者
Rodrigues Pedro Pereira; Gama Joao;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类 TP311.13;
关键词

相似文献

外文文献
中文文献
专利

1. Distributed clustering of ubiquitous data streams [J] . Rodrigues Pedro Pereira, Gama Joao Wiley interdisciplinary reviews. Data mining and knowledge discovery . 2014,第1期

机译：普遍存在的数据流的分布式集群
2. A fuzzy approach for interpretation of ubiquitous data stream clustering and its application in road safety [J] . Osnat Horovitz, Shonali Krishnaswamy, Mohamed Medhat Gaber Intelligent data analysis . 2007,第1期

机译：模糊解释泛在数据流聚类的方法及其在道路安全中的应用
3. Adaptive Fuzzy Clustering of Short Time Series with Unevenly Distributed Observations in Data Stream Mining Tasks [J] . Yevgeniy Bodyanskiy, Olena Vynokurova, Ilya Kobylin, Information Technology and Management Science . 2016,第1期

机译：数据流挖掘任务中具有不均匀分布观测值的短时间序列的自适应模糊聚类
4. Resource-Aware Density-and-Grid-Based Clustering in Ubiquitous Data Streams [C] . Ching-Ming Chao Advanced Information Networking and Applications Workshops (WAINA), 2012 26th International Conference on . 2012

机译：无处不在的数据流中基于资源感知的基于密度和网格的群集
5. Approximate Clustering Algorithms for High Dimensional Streaming and Distributed Data [D] . Carraher, Lee A. 2018

机译：高维流和分布式数据的近似聚类算法
6. A Distributed Stream Processing Middleware Framework for Real-Time Analysis of Heterogeneous Data on Big Data Platform: Case of Environmental Monitoring [O] . Adeyinka Akanbi, Muthoni Masinde 2020

机译：大数据平台上异构数据实时分析的分布式流处理中间件框架：环境监测案例
7. Distributed clustering of ubiquitous data streams [O] . Pedro Pereira Rodrigues, João Gama 2013

机译：普遍存在数据流的分布式聚类

Distributed clustering of ubiquitous data streams

摘要

著录项

相似文献

相关主题

期刊订阅