Towards a Simple Clustering Criterion Based on Minimum Length Encoding

机译：基于最小长度编码的简单聚类判据

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a simple and intuitive clustering evaluation criterion based on the minimum description length principle which yields a particularly simple way of describing and encoding a set of examples. The basic idea is to view a clustering as a restriction of the attribute domains, given an example's cluster membership. As a special operational case we develop the so-called rectangular uniform message length measure that can be used to evaluate clusterings described as sets of hyper-rectangles. We theoretically prove that this measure punishes cluster boundaries in regions of uniform instance distribution (i.e., unintuitive clusterings), and we experimentally compare a simple clustering algorithm using this measure with the well-known algorithms KMeans and AutoClass.

机译：我们基于最小描述长度原则提出了一种简单直观的聚类评估标准，该准则产生了一种描述和编码示例集的特别简单的方法。给出示例的集群成员资格，基本思想是将集群视为对属性域的限制。作为一种特殊的操作案例，我们开发了所谓的矩形统一消息长度度量，该度量可用于评估被描述为超矩形集的聚类。我们从理论上证明了该措施可以惩罚均匀实例分布区域中的群集边界（即，非直觉群集），并通过实验将使用此措施的简单群集算法与知名算法KMeans和AutoClass进行比较。

著录项

来源
《13th European Conference on Machine Learning, Aug 19-23, 2002, Helsinki, Finland》|2002年|p.258-269|共12页
会议地点 Helsinki(FI);Helsinki(FI)
作者
Marcus-Christopher Ludl; Gerhard Widmer;
展开▼
作者单位

Austrian Research Institute for Artificial Intelligence, Vienna University of Vienna, Austria;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. 基于最小编码长度的基因数据聚类（英文） [J] . 汪雪红, 焦清局, 常盼盼, 农业科学与技术（英文版） . 2012,第006期
2. Learning a mixture model for clustering with the completed likelihood minimum message length criterion [J] . Hong Zeng, Yiu-ming Cheung Pattern Recognition: The Journal of the Pattern Recognition Society . 2014,第5期

机译：学习混合模型以完成的似然最小消息长度准则进行聚类
3. SAR Image Segmentation Based on Closeness Degree Cut and Minimum Description Length Criterion [J] . Zhao Wei, Tian Zheng, Yang Lijuan, Journal of the Indian Society of Remote Sensing . 2015,第2期

机译：基于贴近度削减和最小描述长度准则的SAR图像分割
4. Principal Component Analysis and Minimum Description Length Criterion Based on Through-Wall Image Enhancement [J] . Muhammad Mohsin Riaz, Abdul Ghafoor 中国物理快报：英文版 . 2013,第004期

机译：基于穿墙图像增强的主成分分析和最小描述长度准则
5. Towards a Simple Clustering Criterion Based on Minimum Length Encoding [C] . Marcus-Christopher Ludl, Gerhard Widmer European Conference on Machine Learning . 2002

机译：朝着基于最小长度编码的简单聚类标准
6. Empirical comparison of graph-based data-mining systems based on minimum description length [D] . Shah, Rohan Y. 2004

机译：基于最小描述长度的基于图的数据挖掘系统的经验比较
7. Decreasing the minimum length criterion for an episode of hypomania: evaluation using self-reported data from patients with bipolar disorder [O] . Michael Bauer, Tasha Glenn, Natalie Rasgon, -1

机译：降低轻躁狂发作的最小长度标准：使用双相情感障碍患者的自我报告数据进行评估
8. Towards a Minimum Description Length Based Stopping Criterion for Semi- Supervised Time Series Classification [O] . Nurjahan Begum, Bing Hu, Thanawin Rakthanmanon, 2014

机译：面向半监督时间序列分类的基于最小描述长度的停止准则

Towards a Simple Clustering Criterion Based on Minimum Length Encoding

摘要

著录项

相似文献

相关主题

期刊订阅