首页> 外文期刊>Journal of intelligent & fuzzy systems: Applications in Engineering and Technology >Clustering based approach for incomplete data streams processing
【24h】

Clustering based approach for incomplete data streams processing

机译:基于集群的不完整数据流处理方法

获取原文
获取原文并翻译 | 示例
       

摘要

Recent applications such as sensor networks generate continuous and dynamic data streams. Data streams are often gathered from multiple data sources with some incompleteness. Clustering such data is constrained by incompleteness of data, data distribution, and continuous nature of data streams. Ignoring missing values in incomplete data clustering, especially in high missing rates decreases the clustering performance. Traditional clustering is applied on the whole data without dealing with data distribution. This paper presents an efficient framework called Fuzzy c-means clustering for Incomplete Data streams (FID) that works adaptively with incomplete data streams even with high missing rates. The proposed FID estimates missing values based on the corresponding nearest-neighbors' intervals. To overcome the previously mentioned data streams clustering problems, the continuous clustering mechanism is adopted and extended to accurately handle the incomplete data streams. Experimental results using two different data sets prove the efficiency of the proposed FID comparing to the alternative approaches.
机译:最近的应用程序等传感器网络产生连续和动态的数据流。数据流通常由多个数据源收集,具有一些不完整性。群集此类数据受到数据流,数据分布和数据流的连续性的不完整性的约束。忽略不完整的数据群集中的缺失值,尤其是高缺失的速率会降低聚类性能。传统的聚类应用于整个数据,而无需处理数据分发。本文介绍了一个名为Fuzzy C-Means集群的有效框架,用于不完整的数据流(FID),即使具有高缺失的速率,即使具有不完整的数据流也适用于不完整的数据流。所提出的FID基于相应的最近邻居的间隔估计缺失值。为了克服前面提到的数据流聚类问题,采用连续聚类机制并扩展以精确处理不完整的数据流。使用两种不同数据集的实验结果证明了建议FID比较与替代方法的效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号