首页> 外文会议>IEEE Joint International Information Technology and Artificial Intelligence Conference >Outlier Detection Method based on Improved Two-step Clustering Algorithm and Synthetic Hypothesis Testing

【24h】

Outlier Detection Method based on Improved Two-step Clustering Algorithm and Synthetic Hypothesis Testing

机译：基于改进的两步聚类算法和综合假设检验的异常值检测方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

For the detection of outliers in unsupervised mixed attribute datasets, this paper proposes an outlier detection method based on improved two-step clustering algorithm and synthetic hypothesis testing. Firstly, a two-step clustering algorithm based on clustering feature tree is improved to be suitable for mixed attribute datasets. In the clustering stage, RA-Clust algorithm is used to pre-select the clustering center to improve the algorithm performance, and the original complex data distribution is simplified into the superposition of data distribution under sub-clusterings. Then using the method of synthetic hypothesis testing, the outliers are judged according to the change rule of data in the sub-clusterings of sample data. Experimental results show that the algorithm has better clustering performance, and the synthetic hypothesis testing can accurately detect most of the outliers and maintain over 90% of the outlier detect efficiency under different data sizes.

机译：为了检测非监督混合属性数据集中的离群值，本文提出了一种基于改进的两步聚类算法和综合假设检验的离群值检测方法。首先，对基于聚类特征树的两步聚类算法进行了改进，使其适用于混合属性数据集。在聚类阶段，使用RA-Clust算法预先选择聚类中心以提高算法性能，并将原始的复杂数据分布简化为子聚类下数据分布的叠加。然后采用综合假设检验的方法，根据样本数据子类中数据的变化规律判断离群值。实验结果表明，该算法具有较好的聚类性能，综合假设检验可以准确地检测出大多数离群值，并在不同数据量下保持了超过90％的离群值检测效率。

著录项

来源
《IEEE Joint International Information Technology and Artificial Intelligence Conference 》|2019年|915-919|共5页
会议地点
作者
Geyu Huang; Zhiming Zhang; Wenxin Yang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Clustering algorithms; Testing; Anomaly detection; Data models; Classification algorithms; Linear regression; Probability density function;

机译：聚类算法;测试;异常检测;数据模型;分类算法;线性回归;概率密度函数;

相似文献

外文文献
中文文献
专利

1. Generalised linear model-based algorithm for detection of outliers in environmental data and comparison with semi-parametric outlier detection methods [J] . Martina ?ampulová, Jaroslav Michálek, Ji?í Mou?ka Atmospheric Pollution Research . 2019 ,第4期

机译：基于线性模型的基于线性模型的算法，用于检测环境数据中的异常值和半参数异常检测方法的比较
2. A Refined Rough K-Means Clustering Algorithm based on Minimizing the Effect of Local Outlier Objects to Improve Overlapping Detection [J] . Khaled Ali Othman, Md. Nasir Sulaiman, Norwati Mustapha, Research journal of applied science, engineering and technology . 2017 ,第8期

机译：一种基于最小化局部离群对象影响的改进的粗糙K均值聚类算法，以改善重叠检测
3. Improving hierarchical cluster analysis: A new method with outlier detection and automatic clustering [J] . J.A.S. Almeida, L.M.S. Barbosa, A.A.C.C. Pais, Chemometrics and Intelligent Laboratory Systems . 2007 ,第2期

机译：改进层次聚类分析：具有异常值检测和自动聚类的新方法
4. Outlier Detection Method based on Improved Two-step Clustering Algorithm and Synthetic Hypothesis Testing [C] . Geyu Huang, Zhiming Zhang, Wenxin Yang IEEE Joint International Information Technology and Artificial Intelligence Conference . 2019

机译：基于改进的两步聚类算法和合成假设检测的异常检测方法
5. Multiple hypothesis testing and multiple outlier identification methods. [D] . Yin, Yaling. 2010

机译：多种假设检验和多种离群值识别方法。
6. Data mining application to healthcare fraud detection: a two-step unsupervised clustering method for outlier detection with administrative databases [O] . Michela Carlotta Massi, Francesca Ieva, Emanuele Lettieri 2020

机译：数据挖掘应用于医疗保健欺诈检测：使用管理数据库的异常值检测的两步无监督群集方法
7. An Outlier Detection Approach Based on Improved Self-Organizing Feature Map Clustering Algorithm [O] . Ping Yang, Dan Wang, Zhuojun Wei, 2019

机译：一种基于改进自组织特征映射聚类算法的异常检测方法
8. Validation Test Report for the Improved Synthetic Ocean Profile (ISOP) System, Part I: Synthetic Profile Methods and Algorithm. [R] . Helber, R. W., Townsend, T. L., Barron, C. N., 2013

机译：改进的合成海洋剖面（IsOp）系统的验证测试报告，第一部分：合成剖面方法和算法。

Outlier Detection Method based on Improved Two-step Clustering Algorithm and Synthetic Hypothesis Testing

摘要

著录项

相似文献

相关主题

期刊订阅