首页> 外文会议>IEEE International Congress on Big Data >Top (k1, k2) Distance-based outliers detection in an uncertain dataset
【24h】

Top (k1, k2) Distance-based outliers detection in an uncertain dataset

机译:不确定数据集中的基于(k1,k2)的基于距离的离群值检测

获取原文

摘要

In this paper, we focus on distance-based outliers detection in an uncertain dataset, which is very useful in large social network. Based on the x-tuple model and the possible world semantics, we propose the concept of tuple outlier score, top k probability and top (k1, k2) distance-based outlier. We then design an algorithm using dynamic programming technique to calculate tuple outlier scores and detect top (k1, k2) distance-based outliers. The local neighbor region is proposed to detect approximate outliers with high precision efficiently. We also propose two pruning strategies to avoid additional computation overhead and prune data objects that cannot be outliers. After theory analysis, we conduct experiments in two real datasets to verify good performance of our method.
机译:在本文中,我们专注于在不确定的数据集中的距离的异常值检测,这在大型社交网络中非常有用。基于X组模型和可能的世界语义,我们提出了元组异常分数的概念,顶部K概率和顶部(K1,K2)距离的异常值。然后,我们使用动态编程技术设计一种算法来计算元组异常分数,并检测基于距离的异常值的顶部(K1,K2)。建议局部邻居区域以有效地检测高精度的近似异常值。我们还提出了两个修剪的策略,以避免额外的计算开销和剪枝数据对象,不能是异常值。在理论分析之后,我们在两个真实数据集中进行实验,以验证我们的方法的良好性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号