首页> 外文会议>International Conference on Digital Signal Processing >Distance-based k-nearest neighbors outlier detection method in large-scale traffic data
【24h】

Distance-based k-nearest neighbors outlier detection method in large-scale traffic data

机译:大规模交通数据中基于距离的k近邻离群值检测方法

获取原文

摘要

This paper presents a k-nearest neighbors (kNN) method to detect outliers in large-scale traffic data collected daily in every modern city. Outliers include hardware and data errors as well as abnormal traffic behaviors. The proposed kNN method detects outliers by exploiting the relationship among neighborhoods in data points. The farther a data point is beyond its neighbors, the more possible the data is an outlier. Traffic data here was recorded in a video format, and converted to spatial-temporal (ST) traffic signals by statistics. The ST signals are then transformed to a two-dimensional (2D) (x, y) -coordinate plane by Principal Component Analysis (PCA) for dimension reduction. The distance-based kNN method is evaluated by unsupervised and semi-supervised approaches. The semi-supervised approach reaches 96.19% accuracy.
机译:本文提出了一种k近邻法(kNN),用于检测每个现代城市每天收集的大规模交通数据中的离群值。离群值包括硬件和数据错误以及异常的流量行为。提出的kNN方法通过利用数据点中邻域之间的关系来检测离群值。数据点离其邻居越远,则数据越可能成为异常值。这里的交通数据以视频格式记录,并通过统计转换为时空(ST)交通信号。然后,通过主成分分析(PCA)将ST信号转换为二维(2D)(x,y)坐标平面以进行尺寸减小。基于距离的kNN方法通过无监督和半监督方法进行评估。半监督方法的准确率达到96.19%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号