A Fast Clustering Algorithm for Data with a Few Labeled Instances

机译：具有少量标记实例的数据的快速聚类算法

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

The diameter of a cluster is the maximum intracluster distance between pairs of instances within the same cluster, and the split of a cluster is the minimum distance between instances within the cluster and instances outside the cluster. Given a few labeled instances, this paper includes two aspects. First, we present a simple and fast clustering algorithm with the following property: if the ratio of the minimum split to the maximum diameter (RSD) of the optimal solution is greater than one, the algorithm returns optimal solutions for three clustering criteria. Second, we study the metric learning problem: learn a distance metric to make the RSD as large as possible. Compared with existing metric learning algorithms, one of our metric learning algorithms is computationally efficient: it is a linear programming model rather than a semidefinite programming model used by most of existing algorithms. We demonstrate empirically that the supervision and the learned metric can improve the clustering quality.

机译：集群的直径是同一集群内的成对实例之间的最大集群内距离，集群的分裂是集群内的实例与集群外的实例之间的最小距离。给定一些标记的实例，本文包括两个方面。首先，我们提出一种具有以下特性的简单而快速的聚类算法：如果最佳解决方案的最小分裂与最大直径（RSD）之比大于1，则该算法将针对三个聚类标准返回最佳解决方案。其次，我们研究度量学习问题：学习距离度量以使RSD尽可能大。与现有的度量学习算法相比，我们的一种度量学习算法在计算效率方面高：它是一个线性规划模型，而不是大多数现有算法所使用的半定规划模型。我们通过经验证明，监督和学习的指标可以提高聚类质量。

著录项

期刊名称 Computational Intelligence and Neuroscience
作者
Jinfeng Yang; Yong Xiao; Jiabing Wang; Qianli Ma; Yanhua Shen;
展开▼
作者单位

展开▼
年(卷),期 2015(2015),-1
年度 2015
页码 196098
总页数 10
原文格式 PDF
正文语种
中图分类神经科学;
关键词

相似文献

外文文献
中文文献
专利

1. A Fast Clustering Algorithm for Data with a Few Labeled Instances [J] . JinfengYang, YongXiao, JiabingWang, Computational intelligence and neuroscience . 2015,第1期

机译：带有少量标记实例的数据的快速聚类算法
2. A Fast Clustering Algorithm for Data with a Few Labeled Instances [J] . Jinfeng Yang, Yong Xiao, Jiabing Wang, Computational intelligence and neuroscience . 2015,第Pta1期

机译：具有少数标记实例的数据的快速聚类算法
3. Automatically discovering clusters of algorithm and problem instance behaviors as well as their causes from experimental data, algorithm setups, and instance features [J] . Weise Thomas, Wang Xiaofeng, Qi Qi, Applied Soft Computing . 2018,第期

机译：自动发现算法和问题实例行为的集群以及从实验数据，算法设置和实例功能的原因
4. A New Incremental Growing Neural Gas Algorithm Based on Clusters Labeling Maximization: Application to Clustering of Heterogeneous Textual Data [C] . Jean-Charles Lamirel, Zied Boulila, Maha Ghribi, IEA/AIE 2010;International conference on industrial engineering and other applications of applied intelligent systems . 2010

机译：基于聚类标记最大化的增量式神经气体增长算法：在异构文本数据聚类中的应用
5. Fast conceptual clustering algorithms for data mining and visualization. [D] . Moustafa, Rida E. A. 2001

机译：用于数据挖掘和可视化的快速概念性聚类算法。
6. A Fast Cluster Motif Finding Algorithm for ChIP-Seq Data Sets [O] . Yipu Zhang, Ping Wang -1

机译：ChIP-Seq数据集的快速聚类主题查找算法
7. Fast Data Reduction With Granulation-Based Instances Importance Labeling [O] . Xiaoyan Sun, Lian Liu, Cong Geng, 2019

机译：基于造粒的实例的快速数据减少重要性标签

A Fast Clustering Algorithm for Data with a Few Labeled Instances

摘要

著录项

相似文献

相关主题

期刊订阅