...
首页> 外文期刊>Cluster computing >A novel DBSCAN with entropy and probability for mixed data
【24h】

A novel DBSCAN with entropy and probability for mixed data

机译:一种新的DBSCAN,具有混合数据的熵和概率

获取原文
获取原文并翻译 | 示例
           

摘要

In big data situation, to detect clusters of different size and shape is a challenging and imperative task. Density based clustering approaches have been widely used in many areas of science due to its simplicity and the ability to detect clusters of different sizes and shapes over the last several years. With diverse conversion on categorical data, a modified version of the DBSCAN algorithm is proposed to cluster mixed data, noted as density based clustering algorithm for mixed data with integration of entropy and probability distribution (EPDCA). Optional and various conversions are provided for clustering process with adaptability. Some benchmark data sets from UCI have been selected for testing the capability and validity of EPDCA. It was shown that the clustering results of EPDCA are considerably improved, especially on automatically number of clusters formed, noise discovery and time elapsed to form clusters.
机译:在大数据情况下,要检测不同尺寸和形状的集群是一个具有挑战性和势不一准的任务。 由于其简单性和在过去几年中检测不同尺寸和形状的群集,密度基于聚类的聚类方法已被广泛应用于许多科学领域。 随着对分类数据的不同转换,提出了一种DBSCAN算法的修改版本,以集群混合数据,指出为具有熵和概率分布的集成(EPDCA)的混合数据的密度基于集群算法。 可选和各种转换,用于具有适应性的聚类过程。 已选择来自UCI的一些基准数据集以测试EPDCA的功能和有效性。 结果表明,EPDCA的聚类结果显着改善,特别是在形成形成的簇数,噪声发现和形成簇的时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号