首页> 外文会议>IEEE international conference on data engineering >Incremental cluster evolution tracking from highly dynamic network data
【24h】

Incremental cluster evolution tracking from highly dynamic network data

机译:从高度动态的网络数据进行增量集群演化跟踪

获取原文

摘要

Dynamic networks are commonly found in the current web age. In scenarios like social networks and social media, dynamic networks are noisy, are of large-scale and evolve quickly. In this paper, we focus on the cluster evolution tracking problem on highly dynamic networks, with clear application to event evolution tracking. There are several previous works on data stream clustering using a node-by-node approach for maintaining clusters. However, handling of bulk updates, i.e., a subgraph at a time, is critical for achieving acceptable performance over very large highly dynamic networks. We propose a subgraph-by-subgraph incremental tracking framework for cluster evolution in this paper. To effectively illustrate the techniques in our framework, we consider the event evolution tracking task in social streams as an application, where a social stream and an event are modeled as a dynamic post network and a dynamic cluster respectively. By monitoring through a fading time window, we introduce a skeletal graph to summarize the information in the dynamic network, and formalize cluster evolution patterns using a group of primitive evolution operations and their algebra. Two incremental computation algorithms are developed to maintain clusters and track evolution patterns as time rolls on and the network evolves. Our detailed experimental evaluation on large Twitter datasets demonstrates that our framework can effectively track the complete set of cluster evolution patterns from highly dynamic networks on the fly.
机译:动态网络通常在当前网络时代找到。在诸如社交网络和社交媒体之类的场景中,动态网络嘈杂,规模庞大且发展迅速。在本文中,我们将重点放在高动态网络上的集群演化跟踪问题上,并将其清楚地应用于事件演化跟踪。以前有一些关于数据流群集的工作,这些数据流使用逐节点方法维护群集。但是,批量更新(即一次子图)的处理对于在超大型高度动态网络上获得可接受的性能至关重要。本文提出了一种基于子图的增量跟踪框架,用于集群演化。为了有效地说明我们框架中的技术,我们将社交流中的事件演化跟踪任务视为一个应用程序,其中社交流和事件分别建模为动态发布网络和动态集群。通过在衰落的时间窗口中进行监视,我们引入了一个骨架图来总结动态网络中的信息,并使用一组原始的演化运算及其代数来形式化集群演化模式。随着时间的流逝和网络的发展,开发了两种增量计算算法来维护集群并跟踪发展模式。我们对大型Twitter数据集的详细实验评估表明,我们的框架可以有效地实时跟踪来自高度动态网络的完整集群演化模式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号