DBDC: Density Based Distributed Clustering

机译：DBDC：基于密度的分布式聚类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Clustering has become an increasingly important task in modem application domains such as marketing and purchasing assistance, multimedia, molecular biology as well as many others. In most of these ureas, the dala are originally collected at different sites. In order to extract information from these dala, they are merged at a central site and then clustered. In this paper, we propose a different approach. We cluster the data locally and extract suitable representatives from these clusters. These representatives are sent to a global server site where we restore the complete cluster-ing based on the local representatives. This approach is very efficient, because the local clustering can be carried out quickly and independently i'rom each other. Furthermore, we have low transmission cost, as the number of transmitted representatives is much smaller than the cardinality of the complete dala set. Based on this small number of representatives, the global clustering can be done very efficiently. For both the local and the global clustering, we use a density based clustering algorithm. The combination of both the local and the global clustering forms our new DBDC (Density Based Distributed Clustering) algorithm. Furthermore, we discuss the complex problem of finding a suitable quality measure for evaluating distributed clusterings. We introduce two quality criteria which are compared to each other and which allow us to evaluate the quality of our DBDC algorithm. In our experimental evaluation, we will show that we do not have to sacrifice clustering quality in order to gain an efficiency advantage when using our distributed clustering approach.

机译：群集已成为Modem应用领域的越来越重要的任务，例如营销和购买援助，多媒体，分子生物学以及其他许多人。在大多数这些植物中，大巴最初在不同的地点收集。为了从这些DALA中提取信息，它们在中央站点合并，然后群集。在本文中，我们提出了一种不同的方法。我们在本地聚集数据，并从这些集群中提取合适的代表。这些代表被发送到全局服务器站点，在那里我们基于本地代表恢复完整的群集。这种方法非常有效，因为本地聚类可以互相快速而独立地进行。此外，由于传输代表的数量远小于完整的DALA集的基数，我们具有较低的传输成本。基于这一少数代表，全局聚类可以非常有效地完成。对于本地和全局聚类，我们使用基于密度的聚类算法。本地和全局聚类的组合形成了我们的新DBDC（基于密度的分布式聚类）算法。此外，我们讨论了寻找评估分布式群集的合适质量措施的复杂问题。我们介绍了两个质量标准，彼此相比，这使我们能够评估我们的DBDC算法的质量。在我们的实验评估中，我们将表明我们不必牺牲聚类质量，以便在使用我们分布式聚类方法时获得效率优势。

著录项

来源
《International Conference on Extending Database Technology》|2004年||共18页
会议地点
作者
Eshref Januzaj; Hans-Peter Kriegel; Martin Pfeifle;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词

相似文献

外文文献
中文文献
专利

1. Balanced Density-Based Clustering Technique Based on Distributed Spatial Analysis in Wireless Sensor Network [J] . Walaa Abdellatief, Osama Youness, Hatem Abdelkader, International journal of wireless information networks . 2019,第2期

机译：无线传感器网络中基于分布式空间分析的基于平衡密度的聚类技术
2. A big data driven distributed density based hesitant fuzzy clustering using Apache spark with application to gene expression microarray [J] . Hosseini Behrooz, Kiani Kourosh Engineering Applications of Artificial Intelligence . 2019,第MARa期

机译：基于Apache Spark的大数据驱动的基于分布密度的犹豫模糊聚类及其在基因表达微阵列中的应用
3. A Bio-Inspired Solution to Cluster-Based Distributed Spectrum Allocation in High-Density Cognitive Internet of Things [J] . Li Jiaxun, Zhao Haitao, Hafid Abdelhakim Senhaji, Internet of Things Journal, IEEE . 2019,第6期

机译：一种基于集群的分布式频谱分配在高密度认知互联网中的生物启发解决方案
4. DBDC: Density Based Distributed Clustering [C] . Eshref Januzaj, Hans-Peter Kriegel, Martin Pfeifle International Conference on Extending Database Technology(EDBT 2004); 20040314-20040318; Heraklion; GR . 2004

机译：DBDC：基于密度的分布式集群
5. On density-based and representative-based spatial clustering algorithms. [D] . Chen, Chun-Sheng. 2011

机译：基于密度和基于代表的空间聚类算法。
6. A Novel Radar HRRP Recognition Method with Accelerated T-Distributed Stochastic Neighbor Embedding and Density-Based Clustering [O] . Hao Wu, Dahai Dai, Xuesong Wang 2019

机译：加速T分布随机邻域嵌入和基于密度聚类的雷达HRRP识别新方法
7. DBDC: A Distributed Bus-Based Data Collection Mechanism for Maximizing Throughput and Lifetime in WSNs [O] . Chih-Yung Chang, Chung-Chih Lin, Cuijuan Shang, 2019

机译：DBDC：基于分布式总线的数据收集机制，用于最大化WSN中的吞吐量和寿命

DBDC: Density Based Distributed Clustering

摘要

著录项

相似文献

相关主题

期刊订阅