首页> 外文会议>International Neural Network Society Conference on Big Data >Parallel Computing TEDA for High Frequency Streaming Data Clustering
【24h】

Parallel Computing TEDA for High Frequency Streaming Data Clustering

机译:用于高频流数据群集的并行计算TEDA

获取原文

摘要

In this paper, a novel online clustering approach called Parallel TEDA is introduced for processing high frequency streaming data. This newly proposed approach is developed within the recently introduced TEDA theory and inherits all advantages from it. In the proposed approach, a number of data stream processors are involved, which collaborate with each other efficiently to achieve parallel computation as well as a much higher processing speed. A fusion center is involved to gather the key information from the processors which work on chunks of the whole data stream and generate the overall output. The quality of the generated clusters is being monitored within the data processors all the time and stale clusters are being removed to ensure the correctness and timeliness of the overall clustering results. This, in turn, gives the proposed approach a stronger ability of handling shifts/drifts that may take place in live data streams. The numerical experiments performed with the proposed new approach Parallel TEDA on benchmark datasets present higher performance and faster processing speed when compared with the alternative well-known approaches. The processing speed has been demonstrated to fall exponentially with more data processors involved. This new online clustering approach is very suitable and promising for real-time high frequency streaming processing and data analytics.
机译:本文介绍了一种新的在线聚类方法,用于处理高频流数据。这种新提出的方法是在最近引入的TEDA理论中开发的,并继承了它的所有优势。在所提出的方法中,涉及许多数据流处理器,其有效地彼此协作以实现并行计算以及更高的处理速度。融合中心涉及从处理器中收集关键信息,该处处理整个数据流的块并生成整体输出。所有时间和陈旧的集群都会在数据处理器内监视所生成的集群的质量,以确保整体聚类结果的正确性和及时性。这反过来,这使得提出的方法能够在实时数据流中处理可能发生的换档/漂移的能力。与替代着名的方法相比,在基准数据集中,在基准数据集上进行了在基准数据集上进行了更高的性能和更快的处理速度的数值实验。已经证明了处理速度以符合更多数据处理器呈指数逐渐下降。这种新的在线聚类方法非常适合和有希望的实时高频流处理和数据分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号