Dependency of Constrained Clustering of Transaction Data on Known Data Distribution

机译：关于已知数据分布的交易数据的约束群集的依赖性

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Most well-known partitioning clustering algorithms adopt an iterative procedure to converge to the stable status. One problem is that the quality of clustering and execution time is especially sensitive to initial conditions (e.g. initial cluster centers and cluster number). In addition, the method used to measure similarity between two transaction data is also an important factor. In general, the similarity method is established in advance and usually employs metric-based distance measuring, which does not consider the variation in the content. The disadvantage is that an analyst is unable to modify the measuring method to suit the need of a particular analysis. In this paper, therefore, we propose a novel constrained clustering algorithm called CCKD (short for Constrained Clustering depend on Known data Distribution). With CCKD, the analyst is able to specify the constrains for measuring similarity that set conditions on capturing clusters. In addition, our empirical results indicate that CCKD is an effective and stable algorithm without any iterative procedure.

机译：大多数众所周知的分区聚类算法采用迭代过程来收敛到稳定状态。一个问题是聚类和执行时间的质量对初始条件特别敏感（例如，初始群集中心和群集号）。此外，用于测量两个交易数据之间相似性的方法也是一个重要因素。通常，预先建立相似性方法，通常采用基于度量的距离测量，这不考虑内容的变化。缺点是分析师无法修改测量方法以满足特定分析的需要。因此，在本文中，我们提出了一种名为CCKD的新型受限聚类算法（限制群集的短路取决于已知的数据分布）。使用CCKD，分析师能够指定测量在捕获集群上设置条件的相似性的约束。此外，我们的经验结果表明CCKD是一种有效且稳定的算法，没有任何迭代程序。

著录项

来源
《IEEE Joint Conference on E-Commerce Technology》|2007年||共7页
会议地点
作者
Chang Hui-Chu; Chen Ming-Syan; CSMDL;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP393-53;
关键词

相似文献

外文文献
中文文献
专利

1. Detecting intrusion transactions in databases using data item dependencies and anomaly analysis [J] . Sattar Hashemi, Ying Yang, Davoud Zabihzadeh, Expert Systems . 2008,第5期

机译：使用数据项依赖性和异常分析来检测数据库中的入侵事务
2. Vine copulas for mixed data : multi-view clustering for mixed data beyond meta-Gaussian dependencies [J] . Tekumalla Lavanya Sita, Rajan Vaibhav, Bhattacharyya Chiranjib Machine Learning . 2017,第9a10期

机译：混合数据的藤蔓copulas：超出元高斯依存关系的混合数据的多视图聚类
3. A Data Cleansing Method for Clustering Large-Scale Transaction Databases [J] . Woong-Kee LOH, Yang-Sae MOON, Jun-Gyu KANG IEICE transactions on information and systems . 2010,第11期

机译：集群大型交易数据库的数据清理方法
4. Dependency of Constrained Clustering of Transaction Data on Known Data Distribution [C] . Chang, Hui-Chu, Chen, . 2007

机译：交易数据约束聚类对已知数据分布的依赖性
5. BAYESIAN IMAGE PROCESSING OF DATA FROM CONSTRAINED SOURCE DISTRIBUTIONS. [D] . LIANG, ZHENGRONG. 1987

机译：来自约束源分布的数据的贝叶斯图像处理。
6. Model-based clustering of DNA methylation array data: a recursive-partitioning algorithm for high-dimensional data arising as a mixture of beta distributions [O] . E Andres Houseman, Brock C Christensen, Ru-Fang Yeh, 2008

机译：DNA甲基化阵列数据的基于模型的聚类：针对β分布混合出现的高维数据的递归划分算法
7. Vine copulas for mixed data : multi-view clustering for mixed data beyond meta-Gaussian dependencies [O] . Tekumalla, Lavanya Sita, Rajan, Vaibhav, Bhattacharyya, Chiranjib 2017

机译：混合数据的藤蔓copulas：超出元高斯依存关系的混合数据的多视图聚类

Dependency of Constrained Clustering of Transaction Data on Known Data Distribution

摘要

著录项

相似文献

相关主题

期刊订阅