首页> 外文会议>International Computer Symposium >An Effective Clustering Mechanism for Uncertain Data Mining Using Centroid Boundary in UKmeans

【24h】

An Effective Clustering Mechanism for Uncertain Data Mining Using Centroid Boundary in UKmeans

机译：用乌克西集中边界的不确定数据挖掘有效聚类机制

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Object errors affect the time cost and effectiveness in uncertain data clustering. For decreasing the time cost and increasing the effectiveness, we propose two mechanisms for the centroid based clustering, UKmeans. The first mechanism is an improved similarity. Similarity is an intuitive factor that immediately affects the time cost and effectiveness. For example, similarity calculations with integration focus on the effectiveness of clustering but ignore the time cost. On the contrary, the similarity calculations by simplified approaches address on the issue of time cost but ignore the effectiveness. In this study, for considering both the time cost and effectiveness, we use a simplified similarity for reducing the time cost, and add additional two factors, namely intersection and density of clusters, to increase the effectiveness of clustering. The former factor can increase the degree of the object belongingness when a cluster overlaps the object. The latter factor can avoid objects to be attracted by clusters which have large errors. The other proposed mechanism is the definition of the centroid boundary. In clustering, the position of a cluster centroid is in an average range which contributes from the belonging objects' errors. However, the large average range causes the low effectiveness of clustering. For decreasing the range, we propose the square root boundary mechanism to limit the upper bound of possible positions of centroids to increase the effectiveness of clustering. In experiments, the results suggest that our two mechanisms work well in the time cost and effectiveness and these two mechanisms complete the UKmeans approaches in uncertain data clustering.

机译：对象错误会影响不确定数据聚类的时间成本和有效性。为了降低时间成本并提高有效性，我们提出了两个基于质心的聚类机制，尤克里·群岛。第一机制是一种改进的相似性。相似性是一种直观的因素，即立即影响时间成本和有效性。例如，具有集成的相似性计算侧重于聚类的有效性，但忽略时间成本。相反，通过简化的方法来解决时间成本问题但忽略了效力的相似性计算。在本研究中，为了考虑时间成本和有效性，我们使用简化的相似性来减少时间成本，并增加额外的两个因素，即簇的交叉点和密度，以提高聚类的有效性。当群集与对象重叠时，前一个因素可以增加对象属性的程度。后一因素可以避免对具有大错误的集群吸引物体。其他提出的机制是质心边界的定义。在聚类中，群集质心的位置处于平均范围，从属于归属对象的错误。但是，较大的平均范围导致聚类的有效性低。为了降低范围，我们提出了平方根边界机制，以限制质心的可能位置的上限，以提高聚类的有效性。在实验中，结果表明，我们的两种机制在时间成本和有效性良好工作，这两个机制在不确定的数据聚类中完成了UKEMENS方法。

著录项

来源
《International Computer Symposium》|2016年|746p|共6页
会议地点
作者
Kuan-Teng Liao; Chuan-Ming Liu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
Upper bound; Lead; Probability density function; Euclidean distance; Force; Computers; Data mining;

机译：上限;铅;概率密度函数;欧几里德距离;力;计算机;数据挖掘;

相似文献

外文文献
中文文献
专利

1. Mechanisms to improve clustering uncertain data with UKmeans [J] . Chuan-Ming Liu, Zhendong Niu, Kuan-Teng Liao Data & Knowledge Engineering . 2018,第JULa期

机译：使用UKmeans改善不确定数据聚类的机制
2. Ensembled Heuristic Iterative Expected Maximization with BrownBoost Data Clustering for Uncertain Data Mining [J] . Muruganantham S., Elango N. M. International Journal of Applied Engineering Research . 2019,第2aPta1期

机译：结合启发式迭代预期最大化与棕色数据集群的不确定数据挖掘
3. Effective Intra Mode Prediction of 3D-HEVC System based on Big Data Clustering and Data Mining [J] . Jinchao Zhao, Shuaichao Wei, Qiuwen Zhang International Journal of Performability Engineering . 2019,第12期

机译：基于大数据聚类和数据挖掘的3D-HEVC系统有效的帧内模式预测
4. An Effective Clustering Mechanism for Uncertain Data Mining Using Centroid Boundary in UKmeans [C] . Kuan-Teng Liao, Chuan-Ming Liu 2016 International Computer Symposium . 2016

机译：UKmeans中使用质心边界进行不确定数据挖掘的有效聚类机制
5. Mining Frequent Itemsets from Uncertain Data: Extensions to Constrained Mining and Stream Mining. [D] . Hao, Boyu. 2010

机译：从不确定的数据中挖掘频繁项集：约束挖掘和流挖掘的扩展。
6. Mining of high utility-probability sequential patterns from uncertain databases [O] . Binbin Zhang, Jerry Chun-Wei Lin, Philippe Fournier-Viger, 2011

机译：从不确定的数据库中挖掘高实用概率顺序模式
7. Uncertain centroid based partitional clustering of uncertain data [O] . Francesco Gullo, Andrea Tagarelli 2012

机译：基于不确定质心的不确定数据的分区聚类

An Effective Clustering Mechanism for Uncertain Data Mining Using Centroid Boundary in UKmeans

摘要

著录项

相似文献

相关主题

期刊订阅