Neighborhood relevant outlier detection approach based on information entropy

Yu Qingying; Luo Yonglong; Chen Chuanming; Bian Weixin

首页> 外文期刊>Intelligent data analysis >Neighborhood relevant outlier detection approach based on information entropy

【24h】

Neighborhood relevant outlier detection approach based on information entropy

机译：基于信息熵的邻域相关离群点检测方法

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Outlier detection is an interesting issue in data mining and machine learning. In this paper, to detect outliers, an information-entropy-based k-nearest neighborhood relevant outlier factor algorithm is proposed that is combined with Shannon information theory and the triangle pruning strategy. The algorithm accounts for the data points whose k-nearest neighbors are distributed on the edge of the range within the designated radius. In particular, the neighborhood influence on each point is considered to address the problem of information concealment and submergence. Information entropy is used to calculate the weights to distinguish the importance of each attribute. Then, based on the attribute weights, the improved pruning strategy reduces the computational complexity of the subsequent procedures by removing some inliers and obtaining the outlier candidate dataset. Finally, according to the weighted distance between the objects in the candidate dataset and those in the original dataset, the algorithm calculates the dissimilarity between each object and its k-nearest neighbors. The data points with the top r dissimilarity are regarded as the outliers. Experimental results show that, compared

机译：离群检测是数据挖掘和机器学习中一个有趣的问题。为了检测离群值，提出了一种基于信息熵的k最近邻相关离群因子算法，该算法结合了Shannon信息理论和三角修剪策略。该算法考虑了其k最近邻分布在指定半径范围内的边缘上的数据点。特别是，在每个点上的邻域影响被认为解决了信息隐藏和淹没的问题。信息熵用于计算权重，以区分每个属性的重要性。然后，基于属性权重，改进的修剪策略通过删除一些内在值并获取异常值候选数据集来降低后续过程的计算复杂性。最后，根据候选数据集中的对象与原始数据集中的对象之间的加权距离，该算法计算每个对象与其k近邻之间的差异。 r相似度最高的数据点被视为离群值。实验结果表明，与

著录项

来源
《Intelligent data analysis》 |2016年第6期|1247-1265|共19页
作者
Yu Qingying; Luo Yonglong; Chen Chuanming; Bian Weixin;
展开▼
作者单位

Anhui Normal Univ, Sch Territorial Resources & Tourism, 189 South Rd Jiuhua Rd, Wuhu 241003, Anhui, Peoples R China|Anhui Normal Univ, Sch Math & Comp Sci, Wuhu, Anhui, Peoples R China;

Anhui Normal Univ, Sch Territorial Resources & Tourism, 189 South Rd Jiuhua Rd, Wuhu 241003, Anhui, Peoples R China|Anhui Normal Univ, Sch Math & Comp Sci, Wuhu, Anhui, Peoples R China;

Anhui Normal Univ, Sch Math & Comp Sci, Wuhu, Anhui, Peoples R China;

Anhui Normal Univ, Sch Math & Comp Sci, Wuhu, Anhui, Peoples R China;

展开▼
收录信息美国《科学引文索引》(SCI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Outlier detection; information entropy; attribute weights; pruning; k-nearest neighborhood relevant outlier factor (kNNROF);

机译：离群值检测;信息熵;属性权重;修剪;k最近邻相关离群因子（kNNROF）;

相似文献

外文文献
中文文献
专利

1. Hybrid data-driven outlier detection based on neighborhood information entropy and its developmental measures [J] . Zhong Yuan, Xianyong Zhang, Shan Feng Expert Systems with Application . 2018,第DECa期

机译：基于邻域信息熵的混合数据驱动离群值检测及其发展措施
2. Local dynamic neighborhood based outlier detection approach and its framework for large-scale datasets [J] . Renmin Wang, Qingsheng Zhu, Jiangmei Luo, Egyptian Informatics Journal . 2021,第2期

机译：基于本地动态邻域的远异构检测方法及其大型数据集的框架
3. Outlier Detection Using the Information Entropy of Neighborhood Rough Sets [J] . Xiangjun Li, Shengfeng Tian, Taorong Qiu, Journal of information and computational science . 2012,第12期

机译：使用邻域粗糙集信息熵的异常值检测
4. A comparative study of cluster based outlier detection, distance based outlier detection and density based outlier detection techniques [C] . Harshada C. Mandhare, S. R. Idate International Conference on Intelligent Computing and Control Systems . 2017

机译：基于聚类的离群值检测，基于距离的离群值检测和基于密度的离群值检测技术的比较研究
5. Outlier detection and multicollinearity in sequential variable selection: A least angle regression-based approach. [D] . Kirtland, Kelly Meredith. 2017

机译：顺序变量选择中的异常值检测和多重共线性：基于最小角度回归的方法。
6. Specific Direction-Based Outlier Detection Approach for GNSS Vector Networks [O] . Yufeng Nie, Ling Yang, Yunzhong Shen 2019

机译：GNSS向量网络中基于特定方向的离群值检测方法
7. Outlier detection in WSN by entropy based machine learning approach [O] . Manmohan Singh Yadav, Shish Ahamad 2020

机译：基于熵的机器学习方法WSN中的异常检测
8. Fraud detection in medicare claims: A multivariate outlier detection approach [R] . Burr, T, Hale, C, Kantor, M 1997

机译：医疗保险索赔中的欺诈检测：多变量异常值检测方法

Neighborhood relevant outlier detection approach based on information entropy

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅