首页> 外文会议>International conference on frontiers of manufacturing and design science >Semi-supervised clustering algorithm based on small size of labeled data
【24h】

Semi-supervised clustering algorithm based on small size of labeled data

机译:基于标记数据量小的半监督聚类算法

获取原文

摘要

In many data mining domains,labeled data is very expensive to generate,how to make the best use of labeled data to guide the process of unlabeled clustering is the core problem of semi-supervised clustering.Most of semi-supervised clustering algorithms require a certain amount of labeled data and need set the values of some parameters,different values maybe have different results.In view of this,a new algorithm,called semi-supervised clustering algorithm based on small size of labeled data,is presented,which can use the small size of labeled data to expand labeled dataset by labeling their k-nearest neighbors and only one parameter.We demonstrate our clustering algorithm with three UCI datasets,compared with SSDBSCAN[4] and KNN,the experimental results confirm that accuracy of our clustering algorithm is close to that of KNN classification algorithm.
机译:在许多数据挖掘领域中,标记数据的生成非常昂贵,如何充分利用标记数据来指导非标记聚类的过程是半监督聚类的核心问题。大多数半监督聚类算法都需要一定的条件。针对这种情况,提出了一种新的算法,即基于小数据量的半监督聚类算法。小尺寸的标记数据通过标记k个近邻和仅一个参数来扩展标记数据集。我们用3个UCI数据集演示了聚类算法,与SSDBSCAN [4]和KNN相比,实验结果证实了该聚类算法的准确性接近于KNN分类算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号