首页> 外文会议>2016 International Conference on Next Generation Intelligent Systems >Imparting clusters from the tree of cores using graph based clustering
【24h】

Imparting clusters from the tree of cores using graph based clustering

机译:使用基于图的群集从核心树中植入群集

获取原文
获取原文并翻译 | 示例

摘要

Data clustering is a data analyzing technique that groups data based on the similarity. The similarities between the objects in the same group are high when data are well clustered and the similarities between objects in different groups are low. The data clustering technique is widely used in different areas such as bioinformatics, image segmentation and market research. All of the well-known clustering algorithms require input parameters which are hard to determine but have a significant influence on the clustering result. There are many methods to find arbitrary shaped clusters. DBSCAN(Density Based Spatial Clustering with Noise) is an example of clustering algorithms. This method is able to detect communities of arbitrary size and shape. It is also used to identify hubs and outliers. However, it needs user to specify a minimum similarity ε and a minimum cluster size μ to define clusters, and is sensitive to the parameter ε which is hard to determine. To overcome this method, proposed a novel density based network clustering method called graph-based clustering (gClu). The main objective of the proposed system is to form clusters from a core connected tree with maximum accuracy and also to detect hubs and outliers.
机译:数据聚类是一种基于相似性对数据进行分组的数据分析技术。当数据很好地聚类时,同一组中的对象之间的相似性较高,而不同组中的对象之间的相似性较低。数据聚类技术广泛应用于生物信息学,图像分割和市场研究等不同领域。所有众所周知的聚类算法都需要难以确定的输入参数,但对聚类结果有重大影响。找到任意形状的簇的方法很多。 DBSCAN(带噪声的基于密度的空间聚类)是聚类算法的一个示例。此方法能够检测任意大小和形状的社区。它还用于识别中心和离群值。但是,需要用户指定最小相似度ε和最小聚类大小μ来定义聚类,并且对难以确定的参数ε敏感。为了克服这种方法,提出了一种新颖的基于密度的网络聚类方法,称为基于图的聚类(gClu)。提出的系统的主要目标是从核心连接树中以最大的精度形成聚类,并检测中心和离群值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号