Variable Selection for Clustering and Classification

Jeffrey L. Andrews; Paul D. McNicholas

首页> 外文期刊>Journal of classification >Variable Selection for Clustering and Classification

【24h】

Variable Selection for Clustering and Classification

机译：聚类和分类的变量选择

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

As data sets continue to grow in size and complexity, effective and efficient techniques are needed to target important features in the variable space. Many of the variable selection techniques that are commonly used alongside clustering algorithms are based upon determining the best variable subspace according to model fitting in a stepwise manner. These techniques are often computationally intensive and can require extended periods of time to run; in fact, some are prohibitively computationally expensive for high-dimensional data. In this paper, a novel variable selection technique is introduced for use in clustering and classification analyses that is both intuitive and computationally efficient. We focus largely on applications in mixture model-based learning, but the technique could be adapted for use with various other clustering/classification methods. Our approach is illustrated on both simulated and real data, highlighted by contrasting its performance with that of other comparable variable selection techniques on the real data sets.

机译：随着数据集规模和复杂性的不断增长，需要有效而高效的技术来针对可变空间中的重要特征。与聚类算法共同使用的许多变量选择技术都是基于根据模型拟合逐步确定最佳变量子空间来进行的。这些技术通常占用大量计算资源，并且可能需要更长的时间才能运行。实际上，对于高维数据，有些计算量过大。本文介绍了一种新颖的变量选择技术，可用于聚类和分类分析，既直观又计算效率高。我们主要关注基于混合模型的学习中的应用程序，但是该技术可以适用于其他各种聚类/分类方法。通过在真实数据集上与其他可比较变量选择技术的性能进行对比，突出显示了我们在模拟数据和真实数据上的方法。

著录项

来源
《Journal of classification》 |2014年第2期|共18页
作者
Jeffrey L. Andrews; Paul D. McNicholas;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自然科学理论与方法论;
关键词
Classification; Cluster analysis; High-dimensional data; Mixture models; Model-based clustering; Variable selection;

机译：分类;聚类分析;高维数据;混合模型;基于模型的聚类;变量选择;

相似文献

外文文献
中文文献
专利

1. Variable Selection for Clustering and Classification [J] . Jeffrey L. Andrews, Paul D. McNicholas Journal of classification . 2014,第2期

机译：聚类和分类的变量选择
2. Papers on normalization, variable selection, classification or clustering of microarray data [J] . David M. Rocke Trey Ideker Olga Troyanskaya John Quackenbush and Joaquin Dopazo Bioinformatics . 2009,第6期

机译：关于微阵列数据的归一化，变量选择，分类或聚类的论文
3. Selection of Variables for Cluster Analysis and Classification Rules [J] . Ricardo Fraiman, Ana Justel, Marcela Svarc Journal of the American statistical association . 2008,第483期

机译：聚类分析和分类规则的变量选择
4. A Variable Clustering Method and it's Applying to Variable Selection in Multiple Linear Regression Model [C] . XuYuan The proceedings of 2010 international conference on probability and statistics of the International Institute for General Systems Studies.;vol. 2.;Applications on probability and statistics . 2010

机译：变量聚类方法及其在多元线性回归模型中的变量选择
5. Contributions to classification with missing values and variable selection in clustering [D] . Li, Li 2008

机译：聚类中缺失值和变量选择对分类的贡献
6. Variable Selection in Nonparametric Classification via Measurement Error Model Selection Likelihoods [O] . L. A. Stefanski, Yichao Wu, Kyle White -1

机译：通过测量误差模型选择可能性的非参数分类中的变量选择
7. Variable Selection for Clustering and Classification∗ [O] . Jeffrey L. Andrews, Paul D. Mcnicholas 2016

机译：聚类和分类的变量选择*

Variable Selection for Clustering and Classification

摘要

著录项

相似文献

相关主题

期刊订阅