首页> 外文会议>ACM SIGKDD international conference on knowledge discovery and data mining;KDD 10 >Metric Forensics: A Multi-Level Approach for Mining Volatile Graphs
【24h】

Metric Forensics: A Multi-Level Approach for Mining Volatile Graphs

机译:度量取证:挖掘易失图的多层次方法

获取原文
获取外文期刊封面目录资料

摘要

Advances in data collection and storage capacity have made it increasingly possible to collect highly volatile graph data for analysis. Existing graph analysis techniques are not appropriate for such data, especially in cases where streaming or near-real-time results are required. An example that has drawn significant research interest is the cyber-security domain, where internet communication traces are collected and real-time discovery of events, behaviors, patterns, and anomalies is desired. We propose MetricForensics, a scalable framework for analysis of volatile graphs. MetricForen-SICS combines a multi-level "drill down" approach, a collection of user-selected graph metrics, and a collection of analysis techniques. At each successive level, more sophisticated metrics are computed and the graph is viewed at finer temporal resolutions. In this way, METRICFORENSICS scales to highly volatile graphs by only allocating resources for computationally expensive analysis when an interesting event is discovered at a coarser resolution first. We test METRICFORENSICS on three real-world graphs: an enterprise IP trace, a trace of legitimate and malicious network traffic from a research institution, and the MIT Reality Mining proximity sensor data. Our largest graph has ~3M vertices and ~32M edges, spanning 4.5 days. The results demonstrate the scalability and capability of METRICFORENSICS in analyzing volatile graphs; and highlight four novel phenomena in such graphs: elbows, broken correlations, prolonged spikes, and lightweight stars.
机译:数据收集和存储容量的进步使得收集高度易变的图形数据进行分析的可能性越来越大。现有的图形分析技术不适用于此类数据,尤其是在需要流式传输或接近实时的结果的情况下。一个引起广泛研究兴趣的例子是网络安全领域,该领域收集了互联网通信踪迹,并且需要实时发现事件,行为,模式和异常。我们提出MetricForensics,这是一种用于分析易失性图形的可扩展框架。 MetricForen-SICS结合了多级“向下钻取”方法,用户选择的图形指标的集合以及分析技术的集合。在每个连续级别上,都会计算更复杂的指标,并以更精细的时间分辨率查看图形。通过这种方式,当首先以较粗略的分辨率发现有趣的事件时,METRICFORENSICS仅通过分配资源进行计算量大的分析即可缩放到高度易变的图。我们在三个真实的图形上测试METRICFORENSICS:企业IP跟踪,来自研究机构的合法和恶意网络流量跟踪以及MIT Reality Mining接近传感器数据。我们最大的图形具有约300万个顶点和约3200万条边,跨度为4.5天。结果表明,METRICFORENSICS在分析易失性图时具有可扩展性和功能。并在此类图中突出显示了四种新颖现象:肘部,相关性破坏,尖峰延长和轻型恒星。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号