【24h】

Variance based data fusion for k-means++

机译:用于k均值的基于方差的数据融合++

获取原文
获取原文并翻译 | 示例

摘要

The need of clustering the data has been increased day by day in various applications such as Intrusion Detection System, Image Recognition System, etc. Clustering is very much useful in splitting the huge unlabeled data itemset into meaningful groups using similarity metrics. But, at the same time, the cost of the clustering algorithm is computationally expensive for such high dimensional data. Therefore, in our proposed `Variance based Data Fusion K-Means++' data (attribute values) available in multiple dimensions are fused to single (or) very few dimensions. By fusing the data, the vital characteristics of the data are perfectly retained by appropriately weighing the attributes. Proposed preprocessing method is used with K-Means++ and tested with publicly available datasets. From the experiments, it is evident that the VDF K-Means++ achieves high accuracy with fewer false alarms and less processing time than the existing algorithms.
机译:在各种应用程序(例如入侵检测系统,图像识别系统等)中,对数据进行聚类的需求日益增加。聚类在使用相似性度量将巨大的未标记数据项集分成有意义的组中非常有用。但是,与此同时,对于这种高维数据,聚类算法的成本在计算上是昂贵的。因此,在我们提出的“基于方差的数据融合K-Means ++”中,可以将多维可用的数据(属性值)融合到单个(或很少)维上。通过融合数据,可以通过适当权衡属性来完美保留数据的重要特征。拟议的预处理方法与K-Means ++一起使用,并与公开可用的数据集进行了测试。从实验中可以明显看出,与现有算法相比,VDF K-Means ++可以实现更高的准确度,并具有更少的误报和更少的处理时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号