...
首页> 外文期刊>International journal of information technology and web engineering >A Novel Scalable Signature Based Subspace Clustering Approach for Big Data
【24h】

A Novel Scalable Signature Based Subspace Clustering Approach for Big Data

机译:基于大数据的新型可扩展签名的子空间聚类方法

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

“Big data” as the name suggests is a collection of large and complicated data sets which are usually hard to process with on-hand data management tools or other conventional processing applications. A scalable signature based subspace clustering approach is presented in this article that would avoid identification of redundant clusters. Various distance measures are utilized to perform experiments that validate the performance of the proposed algorithm. Also, for the same purpose of validation, the synthetic data sets that are chosen have different dimensions, and their size will be distributed when opened with Weka. The F1 quality measure and the runtime of these synthetic data sets are computed. The performance of the proposed algorithm is compared with other existing clustering algorithms such as CLIQUE.INSCY and SUNCLU.
机译:“大数据”作为姓名,建议是一个大型和复杂的数据集的集合,通常很难用手数据管理工具或其他传统处理应用程序处理。本文提出了一种可扩展的基于签名的子空间聚类方法,避免识别冗余群集。各种距离测量用于执行验证所提出的算法性能的实验。此外,出于验证的相同目的,所选的合成数据集具有不同的维度,并且随着Weka打开时,它们的大小将分布。计算F1质量测量和这些合成数据集的运行时间。将所提出的算法的性能与其他现有聚类算法进行比较,例如Clique.Inscy和Sunclu。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号