...
首页> 外文期刊>IEICE Transactions on Communications >Cluster Analysis Of Internet Users Based On Hourly Traffic Utilization
【24h】

Cluster Analysis Of Internet Users Based On Hourly Traffic Utilization

机译:基于每小时流量利用的互联网用户聚类分析

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Internet access traffic follows hourly patterns that depend on various factors, such as the periods users stay on-line at the access point (e.g. at home or in the office) or their preferences for applications. The clustering of Internet users may provide important information for traffic engineering and billing. For example, it can be used to set up service differentiation according to hourly behavior, resource optimization based on multi-hour routing and definition of tariffs that promote Internet access in low busy hours. In this work, we propose a methodology for clustering Internet users with similar patterns of Internet utilization, according to their hourly traffic utilization. The methodology resorts to three statistical multi-variate analysis techniques: cluster analysis, principal component analysis and discriminant analysis. The methodology is illustrated through measured data from two distinct ISPs, one using a CATV access network and the other an ADSL one, offering distinct traffic contracts. Principal component analysis is used as an exploratory tool. Cluster analysis is used to identify the relevant Internet usage profiles, with the partitioning around medoids and Ward's method being the preferred clustering methods. For the two data sets, these methods lead to the choice of 3 clusters with different hourly traffic utilization profiles. The cluster structure is validated through discriminant analysis. It is also evaluated in terms of several characteristics of the user traffic not used in the cluster analysis, such as the type of applications, the amount of downloaded traffic, the activity duration and the transfer rate, resulting in coherent outcomes.
机译:互联网访问流量遵循每小时模式,具体取决于各种因素,例如用户在接入点(例如在家或办公室)在线停留的时间段或他们对应用程序的偏好。 Internet用户的群集可能为流量工程和计费提供重要信息。例如,它可用于根据小时行为,基于多小时路由的资源优化以及可在繁忙时段促进Internet访问的费率定义来设置服务差异。在这项工作中,我们提出了一种根据小时流量利用率将具有相似互联网利用率模式的互联网用户聚类的方法。该方法采用三种统计多元分析技术:聚类分析,主成分分析和判别分析。通过从两个不同的ISP(一个使用CATV接入网,另一个使用ADSL,提供不同的流量合同)的测量数据来说明该方法。主成分分析用作探索工具。聚类分析用于识别相关的Internet使用情况,其中以类固醇为中心进行分区,而Ward方法是首选的聚类方法。对于这两个数据集,这些方法导致选择3个具有不同小时流量利用率配置文件的群集。聚类结构通过判别分析得到验证。还根据聚类分析中未使用的用户流量的几个特征来评估它,例如应用程序的类型,下载的流量量,活动持续时间和传输速率,从而产生一致的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号