An Improved Online Stream Data Clustering Algorithm

机译：一种改进的在线流数据聚类算法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The stream data mining is a hot research topic in recent years. In order to improve the efficiency of stream data mining, this paper designs an online stream data clustering algorithm IStrAP. IStrAP considers the features of stream data, such as potentially infinity, rapidness, and inability to scan historical data repeatedly, and introduces a method of eliminating outliers to the existing algorithm StrAP. IStrAP does statistical analysis of the data in reservoir (a temporary storage area) to get the statistics and the parameters that can reflect the data characteristics, removes the abnormal data from the reservoir according to the statistical properties, and then clusters the residuary data in the reservoir. The experimental results show that IStrAP can effectively eliminate outliers, and it not only has higher clustering accuracy and lower time complexity than existing StrAP algorithm, but also has better dynamic adaptability for the stream data.

机译：流数据挖掘是近年来研究的热点。为了提高流数据挖掘的效率，设计了一种在线流数据聚类算法IStrap。 IStrAP考虑了流数据的特性，例如潜在的无限性，快速性和无法重复扫描历史数据，并为现有算法StrAP引入了一种消除异常值的方法。 IStrAP对存储库（临时存储区）中的数据进行统计分析，以获得可以反映数据特征的统计信息和参数，根据统计属性从存储库中删除异常数据，然后将剩余数据聚类到存储库中。水库。实验结果表明，IStrAP可以有效地消除离群值，与现有的StrAP算法相比，不仅具有较高的聚类精度和较低的时间复杂度，而且对流数据具有更好的动态适应性。

著录项

来源
《2012 Second International Conference on Business Computing and Global Informatization.》|2012年|p.526- 529|共4页
会议地点 Shanghai(CN);Shanghai(CN)
作者
Li Lingjuan; Li Xiong;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动信息理论;自动信息理论;
关键词
入库时间 2022-08-26 13:58:24

相似文献

外文文献
中文文献
专利

1. i-CODAS An Improved Online Data Stream Clustering in Arbitrary Shaped Clusters [J] . Md Kamrul Islam, Md Manjur Ahmed, Kamal Zuhairi Zamli Engineering Letters . 2019,第4期

机译：I-Codas在任意形状集群中改进的在线数据流群集
2. An improved algorithm for clustering uncertain traffic data streams based on Hadoop platform [J] . Xu Weixiang, Li Jiaojiao International Journal of Modern Physics, B. Condensed Matter Physics, Statistical Physics, Applied Physics . 2019,第19期

机译：一种改进基于Hadoop平台的不确定交通数据流的改进算法
3. Improved clustering algorithm based on high-speed network data stream [J] . Yin Chunyong, Xia Lian, Zhang Sun, Soft computing: A fusion of foundations, methodologies and applications . 2018,第13期

机译：基于高速网络数据流的改进的聚类算法
4. A new evolving clustering algorithm for online data streams [C] . Clauber Gomes Bezerra, Bruno Sielly Jales Costa, Luiz Affonso Guedes, Institute of Electrical and Electronics Engineers Conference on Evolving and Adaptive Intelligent Systems . 2016

机译：在线数据流的一种新的演化聚类算法
5. Scalable frameworks and algorithms for cluster ensembles and clustering data streams. [D] . Hore, Prodip. 2007

机译：用于集群集成和集群数据流的可扩展框架和算法。
6. An improved adaptive memetic differential evolution optimization algorithms for data clustering problems [O] . Hossam M. J. Mustafa, Masri Ayob, Mohd Zakree Ahmad Nazri, 2015

机译：数据聚类问题的一种改进的自适应模因差分演化优化算法
7. Optimizing Data Stream Representation: An Extensive Survey on Stream Clustering Algorithms [O] . Matthias Carnein, Heike Trautmann 2019

机译：优化数据流表示：对流群集算法的广泛调查

An Improved Online Stream Data Clustering Algorithm

摘要

著录项

相似文献

相关主题

期刊订阅