首页> 外文会议>European Conference on Advances in Databases and Information Systems >Unsupervised Artificial Neural Networks for Outlier Detection in High-Dimensional Data
【24h】

Unsupervised Artificial Neural Networks for Outlier Detection in High-Dimensional Data

机译:无监督人工神经网络在高维数据中的异常检测

获取原文

摘要

Outlier detection is an important field in data mining. For high-dimensional data the task is particularly challenging because of the so-called 'curse of dimensionality': The notion of neighborhood becomes meaningless, and points typically show their outlying behavior only in subspaces. As a result, traditional approaches are ineffective. Because of the lack of a ground truth in real-world data and of a priori knowledge about the characteristics of potential outliers, outlier detection should be considered an unsupervised learning problem. In this paper, we examine the usefulness of unsupervised artificial neural networks - autoencoders, self-organising maps and restricted Boltzmann machines - to detect outliers in high-dimensional data in a fully unsupervised way. Each of those approaches targets at learning an approximate representation of the data. We show that one can measure the 'outlierness' of objects effectively, by measuring their deviation from the. learned representation. Our experiments show that neural-based approaches outperform the current state of the art in terms of both runtime and accuracy.
机译:离群值检测是数据挖掘中的重要领域。对于高维数据,由于所谓的“维数诅咒”,该任务特别具有挑战性:邻域的概念变得毫无意义,并且点通常仅在子空间中显示其孤立行为。结果,传统方法是无效的。由于在现实世界的数据中缺乏基本事实,并且缺乏有关潜在异常值特征的先验知识,因此异常值检测应被视为无监督学习问题。在本文中,我们研究了无监督人工神经网络(自动编码器,自组织映射和受限Boltzmann机器)在完全无监督的情况下检测高维数据中异常值的有用性。这些方法中的每一种都针对学习数据的近似表示。我们表明,通过测量对象与对象的偏差,可以有效地测量对象的“离群值”。学习的表示形式。我们的实验表明,基于神经的方法在运行时间和准确性方面都优于当前的最新技术水平。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号