首页> 外文期刊>Computer networks >An improved clustering algorithm and its application in IoT data analysis
【24h】

An improved clustering algorithm and its application in IoT data analysis

机译:一种改进的聚类算法及其在物联网数据分析中的应用

获取原文
获取原文并翻译 | 示例
           

摘要

With the popularization of the Internet of Things(IoT), the data are exploding. Data analysis is foundation of IoT based applications, and clustering is an important tool for data analysis. In clustering, determining the number of clusters is an important issue, which can be either designated artificially or determined automatically. The artificial methods have many disadvantages. And the automatic methods have distinct advantages, whose critical task is to design an appropriate clusters number updating algorithm. Although many researches have been made, most of them are not effective or cannot guarantee the unique clustering results and the good clustering accuracy rate. Meanwhile, considering that IoT based applications always involved both numerical data and nonnumeric data, and treating all the nonnumeric data in the same way is unpractical, we try to further classify the nonnumeric attributes according to their natures and explore the corresponding similarity metrics respectively. Based on it, an algorithm for determining the initial clustering centers is put forward by the dissimilarities and the densities of data objects. And then, an improved clustering algorithm is designed on a revised inter-cluster entropy for mixed data. The experiments on the 3 datasets in University of California at Irvine(UCI) show that the improved clustering algorithm is a deterministic clustering algorithm with good performance. (C) 2019 Published by Elsevier B.V.
机译:随着物联网的普及(物联网),数据正在爆炸。数据分析是基于IOT的应用程序的基础,群集是数据分析的重要工具。在聚类中,确定群集的数量是一个重要问题,可以是人工或自动确定的。人工方法有许多缺点。并且自动方法具有明显的优势,其关键任务是设计适当的群集数更新算法。虽然已经进行了许多研究,但大多数都没有效果或不能保证独特的聚类结果和良好的聚类精度率。同时,考虑到基于物联网的应用程序始终涉及数值数据和非数字数据,并以相同的方式处理所有非数字数据是不可思议的,我们尝试根据其自然来进一步对非数字属性进行分类并分别探索相应的相似度指标。基于它,通过不同的不同和数据对象的密度提出了一种确定初始聚类中心的算法。然后,改进的聚类算法在修订的群集熵上设计了混合数据。 Irvine(UCI)在加利福尼亚大学(UCI)的3个数据集的实验表明,改进的聚类算法是一种具有良好性能的确定性聚类算法。 (c)2019年由elestvier b.v发布。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号