Density-based clustering of uncertain data

机译：基于密度的不确定数据聚类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In many different application areas, e.g. sensor databases, location based services or face recognition systems, distances between odjects have to be computed based on vague and uncertain data. Commonly, the distances between these uncertain object descriptions are expressed by one numerical distance value. Based on such single-valued distance functions standard data mining algorithms can work without any changes. In this paper, we propose to express the similarity between two fuzzy objects by distance probability functions. These fuzzy distance functions assign a probability value to each possible distance value. By integrating these fuzzy distance functions directly into data mining algorithms, the full information provided by these functions is exploited. In order to demonstrate the benefits of this general approach, we enhance the density-based clustering algorithm DBSCAN so that it can work directly on these fuzzy distance functions. In a detailed experimental evaluation based on artificial and real-world data sets, we show the characteristics and benefits of our new approach.

机译：在许多不同的应用领域，例如传感器数据库，基于位置的服务或面部识别系统，必须基于模糊和不确定的数据来计算目标之间的距离。通常，这些不确定对象描述之间的距离由一个数值距离值表示。基于这种单值距离函数，标准数据挖掘算法可以正常工作。在本文中，我们建议通过距离概率函数来表达两个模糊对象之间的相似性。这些模糊距离函数将概率值分配给每个可能的距离值。通过将这些模糊距离函数直接集成到数据挖掘算法中，可以利用这些函数提供的全部信息。为了证明这种通用方法的好处，我们增强了基于密度的聚类算法DBSCAN，使其可以直接在这些模糊距离函数上工作。在基于人工和真实数据集的详细实验评估中，我们展示了这种新方法的特点和优势。

著录项

来源
《》|2005年|P.672-677|共6页
会议地点
作者
Hans-Peter Kriegel; Martin Pfeifle; PHans-Peter Kriegel;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
uncertain data;

机译：不确定的数据;

相似文献

外文文献
中文文献
专利

1. Novel density-based and hierarchical density-based clustering algorithms for uncertain data [J] . Zhang Xianchao, Liu Han, Zhang Xiaotong Neural Networks: The Official Journal of the International Neural Network Society . 2017,第期

机译：基于新的基于密度和分层密度的基于分层密度的不确定数据集群算法
2. M-FDBSCAN: A multicore density-based uncertain data clustering algorithm [J] . ATAKAN ERDEM, TAFLAN ?MRE GüNDEM Turkish Journal of Electrical Engineering and Computer Sciences . 2014,第1期

机译：M-FDBSCAN：一种基于多核密度的不确定数据聚类算法
3. A fast density-based data stream clustering algorithm with cluster centers self-determined for mixed data [J] . Chen Jin-Yin, He Hui-Hao Information Sciences: An International Journal . 2016,第Null期

机译：针对混合数据自行确定簇中心的基于密度的快速数据流聚类算法
4. Novel Density-Based Clustering Algorithms for Uncertain Data [C] . Xianchao Zhang, Han Liu, Xiaotong Zhang, AAAI Conference on Artificial Intelligence . 2014

机译：基于新的基于密度的聚类算法，用于不确定数据
5. Image reconstruction of muon tomographic data using a density-based clustering method. [D] . Perry, Kimberly B. 2015

机译：使用基于密度的聚类方法对μ子层析成像数据进行图像重建。
6. Automatic Clustering of Flow Cytometry Data with Density-Based Merging [O] . Guenther Walther, Noah Zimmerman, Wayne Moore, 2009

机译：流式细胞仪数据的自动聚类与基于密度的合并
7. Hierarchical Density-Based Clustering of Uncertain Data [O] . 2008

机译：基于分层密度的不确定数据聚类

Density-based clustering of uncertain data

摘要

著录项

相似文献

相关主题

期刊订阅