首页> 外文期刊>Information Processing & Management >One-step spectral rotation clustering for imbalanced high-dimensional data
【24h】

One-step spectral rotation clustering for imbalanced high-dimensional data

机译:用于实施高维数据的一步谱旋转聚类

获取原文
获取原文并翻译 | 示例
       

摘要

The class distribution of imbalanced data sets is skewed in practical application. As traditional clustering methods mainly are designed for improving the overall learning performance, the majority class usually tends to be clustered and the minority class which is more valuable maybe ignored. Moreover, existing clustering methods can be limited for the performance of imbalanced and high-dimensional domains. In this paper, we present one-step spectral rotation clustering for imbalanced high-dimensional data (OSRCIH) by integrating self-paced learning and spectral rotation clustering in a unified learning framework, where sample selection and dimensionality reduction are simultaneously considered with mutual and iterative update. Specifically, the imbalance problem is considered by selecting the same number of training samples from each intrinsic group of the training data, where the sample-weight vector is obtained by self-paced learning. Moreover, dimensionality reduction is conducted by combining subspace learning and feature selection. Experimental analysis on synthetic datasets and real datasets showed that OSRCIH could recognize and enhance the weight of important samples and features so as to avoid the clustering method in favor of the majority class and to improve effectively the clustering performance.
机译:在实际应用中,不平衡数据集的类分布倾斜。由于传统聚类方法主要是为提高整体学习性能而设计的,大多数类通常往往是集群的,并且可能忽略更有价值的少数阶级。此外,现有的聚类方法可以限于不平衡和高维域的性能。在本文中,我们通过在统一的学习框架中积分自定节子学习和频谱旋转聚类来呈现用于非平衡高维数据(OSRCIH)的一步谱旋转聚类,其中采样选择和维度减少,同时考虑相互和迭代更新。具体地,通过选择来自每个内在基团的训练数据的训练样本来考虑不平衡问题,其中采样权重向量通过自花枢学习获得。此外,通过组合子空间学习和特征选择来进行维度降低。合成数据集和实际数据集的实验分析表明,奥斯烃可以识别和增强重要样品和特征的重量,以避免聚类方法支持多数阶级,并有效地改善聚类性能。

著录项

  • 来源
    《Information Processing & Management》 |2021年第1期|102388.1-102388.17|共17页
  • 作者单位

    Guangxi Key Lab of Multi-Source Information Mining and Security Guangxi Normal University Guilin Guangxi China;

    Guangxi Key Lab of Multi-Source Information Mining and Security Guangxi Normal University Guilin Guangxi China;

    Guangxi Key Lab of Multi-Source Information Mining and Security Guangxi Normal University Guilin Guangxi China;

    Guangxi Key Lab of Multi-Source Information Mining and Security Guangxi Normal University Guilin Guangxi China;

    Guangxi Key Lab of Multi-Source Information Mining and Security Guangxi Normal University Guilin Guangxi China;

    Guangxi Key Lab of Multi-Source Information Mining and Security Guangxi Normal University Guilin Guangxi China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Spectral rotation clustering; Dimensionality reduction; Imbalanced data;

    机译:光谱旋转聚类;减少维度;不平衡数据;
  • 入库时间 2022-08-18 22:53:25

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号