Minimum Information Loss Cluster Analysis for Categorical Data

机译：分类数据的最小信息损失聚类分析

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The EM algorithm has been used repeatedly to identify latent classes in categorical data by estimating finite distribution mixtures of product components. Unfortunately, the underlying mixtures are not uniquely identifiable and, moreover, the estimated mixture parameters are starting-point dependent. For this reason we use the latent class model only to define a set of "elementary" classes by estimating a mixture of a large number components. We propose a hierarchical "bottom up" cluster analysis based on unifying the elementary latent classes sequentially. The clustering procedure is controlled by minimum information loss criterion.

机译：通过估计产品成分的有限分布混合物，EM算法已被反复用于识别分类数据中的潜在类别。不幸的是，潜在的混合物不是唯一可识别的，而且，估计的混合物参数与起点有关。因此，我们仅使用潜在类模型通过估计大量组件的混合来定义一组“基本”类。我们提出了基于顺序统一基本潜在类的分层“自下而上”的聚类分析。聚类过程由最小信息丢失准则控制。

著录项

来源
《Machine Learning and Data Mining in Pattern Recognition(MLDM 2007); 20070718-20; Leipzig(DE)》|2007年|P.233-247|共15页
会议地点 Leipzig(DE)
作者
Jiri Grim; Jan Hora;
展开▼
作者单位

Institute of Information Theory and Automation of the Czech Academy of Sciences, P.O. BOX 18, 18208 Prague 8, Czech Republic;

Faculty of Nuclear Science and Physical Engineering Czech Technical University, Trojanova 13, CZ-120 00 Prague 2, Czech Republic;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算机的应用;
关键词

相似文献

外文文献
中文文献
专利

1. Weighted Delta Factor Cluster Ensemble Algorithm for Categorical Data Clustering in Data Mining [J] . Sengottaian Sarumathi, Natesan Shanthi, Mathivanan Sharmila The international arab journal of information technology . 2017,第3期

机译：数据挖掘中分类数据聚类的加权增量因子聚类集成算法
2. Analysis of ordered categorical data using expected loss minimization [J] . M. Hakimi Asiabar, S.M. T.Fatemi Ghomi Quality Control and Applied Statistics . 2007,第1期

机译：使用预期损失最小化分析有序分类数据
3. Analysis of Ordered Categorical Data Using Expected Loss Minimization [J] . M. Hakimi Asiabar, S. M. T. Fatemi Ghomi Quality engineering . 2006,第2期

机译：使用预期损失最小化对有序分类数据进行分析
4. Minimum Information Loss Cluster Analysis for Categorical Data [C] . Jiri Grim, Jan Hora Machine Learning and Data Mining in Pattern Recognition(MLDM 2007) . 2007

机译：分类数据的最小信息丢失集群分析
5. Automatic categorical data clustering and spatial data clustering by consecutive resolution refinement. [D] . Foss, Andrew Philip Ogilvie. 2002

机译：通过连续的分辨率优化自动分类数据聚类和空间数据聚类。
6. Evaluation of Modified Categorical Data Fuzzy Clustering Algorithm on the Wisconsin Breast Cancer Dataset [O] . Amir Ahmad 2016

机译：改进的分类数据模糊聚类算法对威斯康星州乳腺癌数据集的评估
7. Comparing of EA K- modes clustering and NBEA K - modes clustering , A new method for clustering categorical data applying them on the injecting drug users data set [O] . Zamani Nasab Zahra 2017

机译：EA K-模式聚类和NBEA K-模式聚类的比较，一种将分类数据应用于注射毒品使用者数据集的新方法

Minimum Information Loss Cluster Analysis for Categorical Data

摘要

著录项

相似文献

相关主题

期刊订阅