...
首页> 外文期刊>Expert Systems >A method for outlier detection based on cluster analysis and visual expert criteria
【24h】

A method for outlier detection based on cluster analysis and visual expert criteria

机译:基于集群分析和视觉专家标准的异常检测方法

获取原文
获取原文并翻译 | 示例

摘要

Outlier detection is an important problem occurring in a wide range of areas. Outliers are the outcome of fraudulent behaviour, mechanical faults, human error, or simply natural deviations. Many data mining applications perform outlier detection, often as a preliminary step in order to filter out outliers and build more representative models. In this paper, we propose an outlier detection method based on a clustering process. The aim behind the proposal outlined in this paper is to overcome the specificity of many existing outlier detection techniques that fail to take into account the inherent dispersion of domain objects. The outlier detection method is based on four criteria designed to represent how human beings (experts in each domain) visually identify outliers within a set of objects after analysing the clusters. This has an advantage over other clustering-based outlier detection techniques that are founded on a purely numerical analysis of clusters. Our proposal has been evaluated, with satisfactory results, on data (particularly time series) from two different domains: stabilometry, a branch of medicine studying balance-related functions in human beings and electroencephalography (EEG), a neurological exploration used to diagnose nervous system disorders. To validate the proposed method, we studied method outlier detection and efficiency in terms of runtime. The results of regression analyses confirm that our proposal is useful for detecting outlier data in different domains, with a false positive rate of less than 2% and a reliability greater than 99%.
机译:异常值检测是在各种区域中发生的重要问题。异常值是欺诈行为,机械故障,人为错误或简单自然偏差的结果。许多数据挖掘应用程序执行异常值检测,通常是初步步骤,以便过滤输出异常值并构建更多代表性模型。在本文中,我们提出了一种基于聚类过程的异常检测方法。本文概述的提案背后的旨在克服许多现有异常值检测技术的特殊性,该技术未能考虑域对象的固有色散。异常值检测方法基于四个标准,该标准旨在表示人类如何(每个域中的专家)在分析群集后在视觉上识别一组对象中的异常值。这在其他基于聚类的异常值检测技术上具有优势,该技术基于群集的纯粹数值分析。我们的提案已被评估,令人满意的结果,关于来自两个不同领域的数据(特别是时间序列):STABILOMERY,医学分支,研究人类的平衡相关功能(EEG),用于诊断神经系统的神经系统探索障碍。为了验证所提出的方法,我们在运行时研究了方法异常检测和效率。回归分析结果证实,我们的提议对于检测不同域中的异常数据,假阳性率小于2%,可靠性大于99%。

著录项

  • 来源
    《Expert Systems》 |2020年第5期|e12473.1-e12473.23|共23页
  • 作者单位

    Madrid Open Univ UDIMA Engn Sch Dept Comp Sci Madrid Spain;

    Madrid Open Univ UDIMA Engn Sch Dept Comp Sci Madrid Spain;

    Univ Politecn Madrid ETS Ingenieros Informat Campus Montegancedo Madrid Spain;

    Univ Politecn Madrid ETS Ingenieros Informat Campus Montegancedo Madrid Spain;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    clustering; data mining; KDD; outlier detection; visual expert criteria;

    机译:聚类;数据挖掘;KDD;异常值检测;视觉专家标准;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号