Pairwise Data Clustering and Applications

机译：成对数据聚类和应用程序

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Data clustering is an important theoretical topic and a sharp tool for various applications. Its main objective is to partition a given data set into clusters such that the data within the same cluster are "more" similar to each other with respect to certain measures. In this paper, we study the pairwise data clustering problem with pairwise similarity/dissimilarity measures that need not satisfy the triangle inequality. By using a criterion, called the minimum normalized cut, we model the pairwise data clustering problem as a graph partition problem. The graph partition problem based on minimizing the normalized cut is known to be NP-hard. We present a ((4 + o(1)) ln n)-approximation polynomial time algorithm for the minimum normalized cut problem. We also give a more efficient algorithm for this problem by sacrificing the approximation ratio slightly. Further, our scheme achieves a ((2 + o(1)) ln n)-approximation polynomial time algorithm for computing the sparsest cuts in edge-weighted and vertex-weighted undirected graphs, improving the previously best known approximation ratio by a constant factor.

机译：数据集群是各种应用的重要理论主题和急剧的工具。其主要目标是将给定的数据分为集群，使得同一群集内的数据相对于某些措施相似地与彼此类似。在本文中，我们使用不需要满足三角形不等式的成对相似性/不同措施来研究成对数据聚类问题。通过使用称为最小归一化切割的标准，我们将成对数据聚类问题模拟为图形分区问题。已知基于最小化归一化切割的图分区问题是NP-HARD。我们呈现（（4 + O（1））ln n） - 用于最小归一化切割问题的批量多项式时间算法。我们还通过稍微牺牲近似比来给出更有效的算法。此外，我们的方案实现了（（2 + O（1））Ln N） - 用于计算边缘加权和顶点加权的无向图中的稀疏性切割的多项式时间算法，通过恒定因子提高先前最佳已知的近似比。

著录项

来源
《Annual international conference on computing and combinatorics》|2003年||共12页
会议地点
作者
Xiaodong Wu; Danny Z. Chen; James J. Mason; Steven R. Schmid;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类理论、方法;
关键词

相似文献

外文文献
中文文献
专利

1. EFFICIENT APPROXIMATION ALGORITHMS FOR PAIRWISE DATA CLUSTERING AND APPLICATIONS [J] . Xiaodong Wu, Danny Z. Chen, James J. Mason, International journal of computational geometry & applications . 2004,第1a2期

机译：对数据聚类的有效逼近算法和应用
2. Generate pairwise constraints from unlabeled data for semi-supervised clustering [J] . Masud Md Abdul, Huang Joshua Zhexue, Zhong Ming, Data & Knowledge Engineering . 2019,第Sepa期

机译：从未标记的数据生成成对约束以进行半监督聚类
3. Pairwise gene GO-based measures for biclustering of high-dimensional expression data [J] . Juan A. Nepomuceno, Alicia Troncoso, Isabel A. Nepomuceno-Chamorro, BioData Mining . 2018,第1期

机译：基于成对基因GO的高维表达数据聚类的措施
4. Pairwise Data Clustering and Applications [C] . Xiaodong Wu, Danny Z. Chen, James J. Mason, Computing and Combinatorics . 2003

机译：成对数据聚类和应用
5. Data clustering with pairwise constraints. [D] . Yi, Jinfeng. 2014

机译：具有成对约束的数据聚类。
6. Fault Diagnosis by Multisensor Data: A Data-Driven Approach Based on Spectral Clustering and Pairwise Constraints [O] . Massimo Pacella, Gabriele Papadia 2020

机译：多传感器数据的故障诊断：基于频谱聚类和成对约束的数据驱动方法
7. Path Based Pairwise Data Clustering with Application to Texture Segmentation [O] . Bernd Fischer, Thomas Zoller, Joachim M. Buhmann, 2001

机译：基于路径的成对数据聚类及其在纹理分割中的应用
8. Application of Cluster Analysis to Aerometric Data. Volume I. Part 1: Clustering, Validation, and Classification of Data. Part 2: Investigation and Report of Cluster Analysis [R] . Crutcher, H. L. , Nelson, C. , Fairbairn, B. , 1980

机译：聚类分析在航空数据中的应用。第一部分：数据的聚类，验证和分类。第2部分：聚类分析的调查和报告

Pairwise Data Clustering and Applications

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅