首页> 外文会议>European Conference on Machine Learning >Towards a Simple Clustering Criterion Based on Minimum Length Encoding
【24h】

Towards a Simple Clustering Criterion Based on Minimum Length Encoding

机译:朝着基于最小长度编码的简单聚类标准

获取原文

摘要

We propose a simple and intuitive clustering evaluation criterion based on the minimum description length principle which yields a particularly simple way of describing and encoding a set of examples. The basic idea is to view a clustering as a restriction of the attribute domains, given an example's cluster membership. As a special operational case we develop the so-called rectangular uniform message length measure that can be used to evaluate clusterings described as sets of typer-rectangles. We theoretically prove that this measure punishes cluster boundaries in regions of uniform instance distribution (i.e., unintuitive clusterings) and we experimentally compare a simple clustering algorithm using this measure with the well-known algorithms KMeans and AutoClass.
机译:我们提出了一种基于最小描述长度原理的简单且直观的聚类评估标准,其产生了一种特别简单的描述和编码一组示例的方法。考虑到示例的群集成员资格,基本思想是将群集视为属性域的限制。作为一个特殊的操作案例,我们开发所谓的矩形统一消息长度测量,可用于评估为矩形矩形组的集群。理论上,我们证明这项措施惩罚统一实例分发区域的集群边界(即,不完整的群集),我们通过通过众所周知的算法和Aut​​oclass进行了使用该措施进行了实验比较了一个简单的聚类算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号