首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >M-N SCATTER PLOTS TECHNIQUE FOR EVALUATING VARYING-SIZE CLUSTERS AND SETTING THE PARAMETERS OF BI-COPAM AND UNCLES METHODS
【24h】

M-N SCATTER PLOTS TECHNIQUE FOR EVALUATING VARYING-SIZE CLUSTERS AND SETTING THE PARAMETERS OF BI-COPAM AND UNCLES METHODS

机译:用于评估变化型簇的M-N散点图技术,并设定双副植物和叔章方法的参数

获取原文
获取外文期刊封面目录资料

摘要

The recently proposed UNCLES method has the ability to unify clustering results from multiple datasets under different types of external specifications. It can also tunably tighten the results such that many objects are unassigned from all of the clusters to obtain few tight clusters. Despite the success of this method, setting its parameters, such as the number of clusters (K) and the tuning parameters δ and (δ~+, δ~-), has never been automated. As its clusters vary in size, they cannot be validated by the existing validation indices. In this study we present a technique of validation based on our proposed M-N scatter plots. This technique has the ability to provide better fitness values for the clusters which include more objects while preserving their tightness. This well suits the nature of the results of UNCLES. We have applied this technique to a set of bacterial microarray datasets as well as a set of English vowels datasets. Our results demonstrate the success of the M-N plots in selecting the best few clusters out of a pool of clusters generated under varying K, δ, and (δ~+, δ~-) values. Our results also show that the best few clusters can be originated from different partitions, which shows the power of our technique in evaluating individual clusters rather than whole partitions. Finally, despite proposing this technique within the context of the UNCLES framework, it is readily applicable to other clustering results, especially when the parameters are not confidently predefined.
机译:最近提出的叔叔方法能够在不同类型的外部规范下统一多个数据集的聚类结果。它还可以调节可调节的结果,使得许多物体从所有簇都没有分配,以获得几个紧密的簇。尽管该方法的成功,但是设置其参数,例如簇(k)和调谐参数δ和(δ〜+,δ〜 - ),从未得到自动化。由于其群集的大小不同,因此现有的验证指数无法验证。在这项研究中,我们介绍了一种基于我们提出的M-N散点图的验证技术。该技术能够为包括更多物体的簇提供更好的适应性值,同时保持其紧密性。这很好适合叔叔结果的性质。我们已将这种技术应用于一组细菌微阵列数据集以及一组英语元音数据集。我们的结果展示了M-N曲线的成功选择在不同k,δ和(Δ〜Δ〜 - )值下产生的群池中的最佳少量簇。我们的结果还表明,最好的少数群集可以源自不同的分区,这表明我们在评估各个集群而不是整个分区时技术的功能。最后,尽管在叔册框架的背景下提出了这种技术,但很容易适用于其他聚类结果,尤其是当参数不自信地预定义时。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号