首页> 外文会议>IEEE international conference on data engineering >Hierarchical in-network attribute compression via importance sampling
【24h】

Hierarchical in-network attribute compression via importance sampling

机译:通过Importance采样分层网络内属性压缩

获取原文
获取外文期刊封面目录资料

摘要

Many real-world complex systems can be modeled as dynamic networks with real-valued vertex/edge attributes. Examples include users' opinions in social networks and average speeds in a road system. When managing these large dynamic networks, compressing attribute values becomes a key requirement, since it enables the answering of attribute-based queries regarding a node/edge or network region based on a compact representation of the data. To address this problem, we introduce a lossy network compression scheme called Slice Tree (ST), which partitions a network into smooth regions with respect to node/edge values and compresses each value as the average of its region. ST applies a compact representation for network partitions, called slices, that are defined as a center node and radius distance. We propose an importance sampling algorithm to efficiently prune the search space of candidate slices in the ST construction by biasing the sampling process towards the node values that most affect the compression error. The effectiveness of ST in terms of compression error, compression rate, and running time is demonstrated using synthetic and real datasets. ST scales to million-node instances and removes up to 87% of the error in attribute values with a 10 compression ratio. We also illustrate how ST captures relevant phenomena in real networks, such as research collaboration patterns and traffic congestions.
机译:许多现实世界复杂系统可以被建模为具有实值顶点/边缘属性的动态网络。示例包括用户在社交网络中的意见和道路系统的平均速度。当管理这些大动态网络时,压缩属性值成为关键要求,因为它能够基于数据的紧凑表示来支持关于节点/边缘或网络区域的基于属性的查询。为了解决这个问题,我们介绍了一个称为切片树(ST)的有损网络压缩方案,其将网络分区为具有节点/边缘值的平滑区域,并将每个值压缩为其区域的平均值。 ST应用于网络分区的紧凑型表示,称为切片,定义为中心节点和半径距离。我们提出了一种重要的采样算法,通过将采样过程朝向最大影响压缩误差的节点值偏置采样过程,有效地修剪ST结构中的候选切片的搜索空间。使用合成和真实数据集对ST在压缩误差,压缩率和运行时间方面的有效性。 ST缩放到百万节点实例,并以10个压缩比删除最多87%的属性值中的错误。我们还说明了ST如何在真实网络中捕获相关现象,例如研究协作模式和交通拥堵。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号