Detecting Anomalies from End-to-End Internet Performance Measurements (PingER) Using Cluster Based Local Outlier Factor

机译：使用基于群集的本地离群因素从端到端Internet性能度量（PingER）检测异常

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

PingER (Ping End-to-End Reporting) is a worldwide end-to-end Internet performance measurement framework. It was developed by the SLAC National Accelerator Laboratory, Stanford, USA and running from the last 20 years. It has more than 700 monitoring agents and remote sites which monitor the performance of Internet links around 170 countries of the world. At present, the size of the compressed PingER data set is about 60 GB comprising of 100,000 flat files. The data is publicly available for valuable Internet performance analyses. However, the data sets suffer from missing values and anomalies due to congestion, bottleneck links, queuing overflow, network software misconfiguration, hardware failure, cable cuts, and social upheavals. Therefore, the objective of this paper is to detect such performance drops or spikes labeled as anomalies or outliers for the PingER data set. In the proposed approach, the raw text files of the data set are transformed into a PingER dimensional model. The missing values are imputed using the k-NN algorithm. The data is partitioned into similar instances using the k-means clustering algorithm. Afterward, clustering is integrated with the Local Outlier Factor (LOF) using the Cluster Based Local Outlier Factor (CBLOF) algorithm to detect the anomalies or outliers from the PingER data. Finally, anomalies are further analyzed to identify the time frame and location of the hosts generating the major percentage of the anomalies in the PingER data set ranging from 1998 to 2016.

机译：PingER（Ping端到端报告）是一个全球性的端到端Internet性能评估框架。它是由美国斯坦福的SLAC国家加速器实验室开发的，从过去的20年开始运行。它拥有700多个监视代理程序和远程站点，它们监视着世界170个国家/地区的Internet链接的性能。目前，压缩的PingER数据集的大小约为60 GB，包含100,000个平面文件。该数据可公开获得，以进行有价值的Internet性能分析。但是，由于拥塞，瓶颈链路，排队溢出，网络软件配置错误，硬件故障，电缆切断和社会动荡，数据集遭受缺失值和异常的困扰。因此，本文的目的是检测PingER数据集的此类性能下降或峰值，标记为异常或离群值。在提出的方法中，将数据集的原始文本文件转换为PingER维度模型。使用k-NN算法估算缺失值。使用k-均值聚类算法将数据划分为相似的实例。然后，使用基于聚类的局部离群因子（CBLOF）算法将聚类与局部离群因子（LOF）集成在一起，以从PingER数据中检测异常或离群值。最后，对异常进行进一步分析，以识别在1998年至2016年PingER数据集中产生异常主要百分比的主机的时间范围和位置。

著录项

来源
《15th IEEE International Symposium on Parallel and Distributed Processing with Applications and 16th IEEE International Conference on Ubiquitous Computing and Communications》|2017年|982-989|共8页
会议地点 Guangzhou(CN)
作者
Saqib Ali; Guojun Wang; Roger L. Cottrell; Tayyba Anwar;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Internet; Earthquakes; Monitoring; Loss measurement; Hardware; Clustering algorithms;

机译：互联网;地震;监测;损耗测量;硬件;聚类算法;;

相似文献

外文文献
中文文献
专利

1. ADSTREAM: Anomaly Detection in Large-Scale Data Streams Using Local Outlier Factor Based on Micro-Cluster [J] . Advanced Science Letters . 2017,第10期

机译：adstream：使用基于微簇的本地异常因素的大规模数据流中的异常检测
2. Application of Local Outlier Factor Algorithm to Detect Anomalies in Computer Network [J] . Auskalnis Juozas, Paulauskas Nerijus, Baskys Algirdas Elektronika ir Elektrotechnika . 2018,第3期

机译：本地异常因素因子算法在计算机网络中检测异常的应用
3. Detecting Outlier Measurements Based on Graph Rigidity for Wireless Sensor Network Localization [J] . Yang Z., Wu C., Chen T., Vehicular Technology, IEEE Transactions on . 2013,第1期

机译：基于图刚度的离群值检测用于无线传感器网络定位
4. Detecting Anomalies from End-to-End Internet Performance Measurements (PingER) Using Cluster Based Local Outlier Factor [C] . Saqib Ali, Guojun Wang, Roger L. Cottrell, IEEE International Symposium on Parallel and Distributed Processing with Applications . 2017

机译：使用基于集群的本地异常因素因素检测端到端互联网性能测量（Pinger）的异常
5. End-to-end performance measurements for overlay flow engineering in the Internet [D] . Mohamed, Salim Ammir B. 2012

机译：互联网覆盖流工程的端到端性能测量
6. End-to-end performance measurement of Internet based medical applications. [O] . P. Dev, D. Harris, D. Gutierrez, 2002

机译：基于Internet的医疗应用程序的端到端性能度量。
7. Detecting outlier measurements based on graph rigidity for wireless sensor network localization [O] . Zheng Yang, Chenshu Wu, Student Member, 2013

机译：基于图形刚度检测离群测量，用于无线传感器网络定位
8. Use of Mahalanobis Distance for Detecting Outliers and Outlier Clusters in Markedly Non-Normal Data: A Vehicular Traffic Example [R] . Warren, R., Smith, R. F., Cybenko, A. K. 2011

机译：使用马哈拉诺比斯距离检测显着非正态数据中的异常值和异常值群集：车载流量示例

Detecting Anomalies from End-to-End Internet Performance Measurements (PingER) Using Cluster Based Local Outlier Factor

摘要

著录项

相似文献

相关主题

期刊订阅