首页> 外文会议>European conference on principles of data mining and knowledge discovery >Efficient Shared Near Neighbours Clustering of Large Metric Data Sets
【24h】

Efficient Shared Near Neighbours Clustering of Large Metric Data Sets

机译:高效共享邻居大型度量数据集的聚类

获取原文

摘要

Very few clustering methods are capable of clustering data without assuming the availability of operations which are defined only in strongly structured spaces, such as vector spaces. We propose an efficient data clustering method based on the shared near neighbours approach, which requires only a distance definition and is capable of discovering clusters of any shape. Using efficient data structures for querying metric data and a scheme for partitioning and sampling the data, the method can cluster effectively and efficiently data sets whose size exceeds the internal memory size.
机译:很少有群集方法能够在不假设仅在强大结构空间(例如矢量空间)中定义的操作的可用性而不存在群集数据。我们提出了一种基于邻居方法的共享的有效数据聚类方法,该方法仅需要距离定义并且能够发现任何形状的集群。使用高效的数据结构来查询度量数据和用于分区和采样数据的方案,该方法可以有效地群集,并且有效地数据集其大小超过内部存储器大小。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号