首页> 外文期刊>Journal of classification >Variable Selection for Clustering and Classification
【24h】

Variable Selection for Clustering and Classification

机译:聚类和分类的变量选择

获取原文
获取原文并翻译 | 示例
           

摘要

As data sets continue to grow in size and complexity, effective and efficient techniques are needed to target important features in the variable space. Many of the variable selection techniques that are commonly used alongside clustering algorithms are based upon determining the best variable subspace according to model fitting in a stepwise manner. These techniques are often computationally intensive and can require extended periods of time to run; in fact, some are prohibitively computationally expensive for high-dimensional data. In this paper, a novel variable selection technique is introduced for use in clustering and classification analyses that is both intuitive and computationally efficient. We focus largely on applications in mixture model-based learning, but the technique could be adapted for use with various other clustering/classification methods. Our approach is illustrated on both simulated and real data, highlighted by contrasting its performance with that of other comparable variable selection techniques on the real data sets.
机译:随着数据集规模和复杂性的不断增长,需要有效而高效的技术来针对可变空间中的重要特征。与聚类算法共同使用的许多变量选择技术都是基于根据模型拟合逐步确定最佳变量子空间来进行的。这些技术通常占用大量计算资源,并且可能需要更长的时间才能运行。实际上,对于高维数据,有些计算量过大。本文介绍了一种新颖的变量选择技术,可用于聚类和分类分析,既直观又计算效率高。我们主要关注基于混合模型的学习中的应用程序,但是该技术可以适用于其他各种聚类/分类方法。通过在真实数据集上与其他可比较变量选择技术的性能进行对比,突出显示了我们在模拟数据和真实数据上的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号