首页> 外文会议>2010 22nd International Teletraffic Congress >Inferring applications at the network layer using collective traffic statistics
【24h】

Inferring applications at the network layer using collective traffic statistics

机译:使用集体流量统计信息推断网络层上的应用程序

获取原文

摘要

Operating, managing and securing networks require a thorough understanding of the demands placed on the network by the endpoints it interconnects, the characteristics of the traffic the endpoints generate, and the distribution of that traffic over the resources of the network infrastructure. A major differentiator in the types of resource required by traffic is the class of endpoint application that generates it. Service providers determine the application mix present in traffic via measurements, e.g., flow measurements furnished by routers. Previous work has shown that a fairly accurate determination of application type can be made from this data. However, protocol level information, such as TCP/UDP ports and other parts of the transport header, and also parts of the network header in some cases, may not be accessible due to the use of encryption or tunneling protocols by endpoints or gateways. Furthermore, the utility of ports as signifiers of application type has some limitations due to abuse and non-standard usage, amongst other reasons. These factors reduce the classification accuracy. In this paper, we propose a novel technique for inferring the distribution of application classes present in the aggregated traffic flows between endpoints, that exploits both the measured statistics of the traffic flows, and the spatial distribution of those flows across the network. Our method employs a two-step supervised model, where the bootstrapping step provides initial (inaccurate) inference on the traffic application classes, and the graph-based calibration step adjusts the initial inference through the collective spatial traffic distribution. In evaluations using real traffic flow measurements from a large ISP, we show how our method can accurately classify application types within aggregate traffic between endpoints, even without knowledge of ports and other traffic features. While the bootstrap estimate classifies the aggregates with 80% accuracy, incorporating spatia--l distributions through calibration increases the accuracy to 92%, i.e., roughly halving the number of errors.
机译:运行,管理和保护网络需要透彻了解互连的端点对网络提出的要求,端点生成的流量的特性以及该流量在网络基础结构资源上的分布。流量所需的资源类型的主要区别在于生成它的端点应用程序的类别。服务提供商通过诸如路由器提供的流量测量之类的测量来确定流量中存在的应用混合。先前的工作表明,可以从此数据中相当准确地确定应用程序类型。但是,由于端点或网关使用加密或隧道协议,协议级别信息(例如TCP / UDP端口和传输头的其他部分,以及某些情况下的网络头的某些部分)可能无法访问。此外,端口的用途作为应用程序类型的指示符还有其他一些原因,这些原因是由于滥用和非标准用法引起的。这些因素降低了分类准确性。在本文中,我们提出了一种新技术来推断存在于端点之间的聚合流量中的应用程序类别的分布,该技术既利用流量的测量统计数据,又利用这些流量在网络中的空间分布。我们的方法采用了两步监督模型,其中自举步骤提供了有关交通应用类别的初始(不准确)推断,而基于图的校准步骤则通过集合空间交通分布调整了初始推断。在使用来自大型ISP的真实流量测量结果进行的评估中,我们展示了即使在不了解端口和其他流量功能的情况下,我们的方法也可以在端点之间的聚合流量中准确地对应用程序类型进行分类。自举估算值对聚合的分类准确度为80%,同时结合了Spatia- -- 通过校准进行的分布将精度提高到92%,即将错误数量大致减少了一半。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号