...
首页> 外文期刊>Neural Networks and Learning Systems, IEEE Transactions on >A Maximum Margin Approach for Semisupervised Ordinal Regression Clustering
【24h】

A Maximum Margin Approach for Semisupervised Ordinal Regression Clustering

机译:半监督序数聚类的最大余量法

获取原文
获取原文并翻译 | 示例

摘要

Ordinal regression (OR) is generally defined as the task where the input samples are ranked on an ordinal scale. OR has found a wide variety of applications, and a great deal of work has been done on it. However, most of the existing work focuses on supervised/semisupervised OR classification, and the semisupervised OR clustering problems have not been explicitly addressed. In real-world OR applications, labeling a large number of training samples is usually time-consuming and costly, and instead, a set of unlabeled samples can be utilized to set up the OR model. Moreover, although the sample labels are unavailable, we can sometimes get the relative ranking information of the unlabeled samples. This sample ranking information can be utilized to refine the OR model. Hence, how to build an OR model on the unlabeled samples and incorporate the sample ranking information into the process of improving the clustering accuracy remains a key challenge for OR applications. In this paper, we consider the semisupervised OR clustering problems with sample-ranking constraints, which give the relative ranking information of the unlabeled samples, and put forward a maximum margin approach for semisupervised OR clustering (SORC). On one hand, SORC seeks a set of parallel hyperplanes to partition the unlabeled samples into clusters. On the other hand, a loss function is put forward to incorporate the sample ranking information into the clustering process. As a result, the optimization function of SORC is formulated to maximize the margins of the closest neighboring clusters and meanwhile minimize the loss associated with the sample-ranking constraints. Extensive experiments on OR data sets show that the proposed
机译:通常将序数回归(OR)定义为输入样本按序数等级进行排序的任务。 OR已经发现了各种各样的应用程序,并且已经完成了很多工作。但是,大多数现有工作集中在有监督/半监督的OR分类上,并且尚未明确解决半监督或OR聚类问题。在现实世界的OR应用程序中,标记大量的训练样本通常是耗时且昂贵的,而可以使用一组未标记的样本来建立OR模型。此外,尽管样本标签不可用,但有时我们可以获得未标记样本的相对排名信息。该样本排名信息可用于完善OR模型。因此,如何在未标记的样本上建立OR模型并将样本排名信息纳入提高聚类精度的过程仍然是OR应用程序的主要挑战。在本文中,我们考虑了具有样本排序约束的半监督OR聚类问题,该问题给出了未标记样本的相对排名信息,并提出了一种最大余量的半监督OR聚类方法。一方面,SORC寻求一组平行的超平面,将未标记的样本划分为多个簇。另一方面,提出了损失函数,将样本排名信息纳入聚类过程。结果,制定了SORC的优化函数,以最大程度地增加最接近的相邻簇的余量,同时最大程度地减少与样本排序约束相关的损失。在OR数据集上的大量实验表明,提出的

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号