Algorithms for Mining Distance-Based Outliers in Large Datasets

机译：大型数据集中挖掘距离的异常值的算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper deals with finding outliers (exceptions) in large, multidimensional datasets. The identification of outliers can lead to the discovery of truly unexpected knowledge in areas such as electronic commerce, credit card fraud, and even the analysis of performance statistics of professional athletes. Existing methods that we have seen for finding outliers in large datasets can only deal efficiently with two dimensions/attributes of a dataset. Here, we study the notion of DB-(Distance-Based) outliers. While we provide formal and empirical evidence showing the usefulness of DB-outliers, we focus on the development of algorithms for computing such outliers.

机译：本文涉及在大型多维数据集中的异常值（例外）。异常值的识别可以导致在电子商务，信用卡欺诈等领域发现真正意外的知识，甚至是专业运动员绩效统计的分析。我们在大型数据集中寻找异常值的现有方法只能有效地处理数据集的两个维度/属性。在这里，我们研究DB-（基于距离）异常值的概念。虽然我们提供了表现为DB异常值的有用性的正式和经验证据，但我们专注于为计算此类异常值的算法的开发。

著录项

来源
《International conference on very large databases》|1998年||共12页
会议地点
作者
Edwin M. Knorr; Raymond T. Ng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类各种专用数据库;
关键词

相似文献

外文文献
中文文献
专利

1. Fast mining of distance-based outliers in high-dimensional datasets [J] . Amol Ghoting, Srinivasan Parthasarathy, Matthew Eric Otey Data Mining and Knowledge Discovery . 2008,第3期

机译：快速挖掘高维数据集中基于距离的离群值
2. Fast mining of distance-based outliers in high-dimensional datasets [J] . Ghoting A, Parthasarathy S, Otey ME Data mining and knowledge discovery . 2008,第3期

机译：高维数据集中基于距离的离群值的快速挖掘
3. Research on Algorithms for Mining Distance-Based Outliers [J] . WANGLizhen, ZOULikun 电子学报：英文版 . 2005,第003期

机译：基于距离的离群值挖掘算法研究
4. Algorithms for Mining Distance-Based Outliers in Large Datasets [C] . Edwin M. Knorr, Raymond T. Ng International conference on very large databases . 1998

机译：大型数据集中挖掘距离的异常值的算法
5. Empirical performance analysis of two algorithms for mining intentional knowledge of distance-based outliers. [D] . Prasanthi, Enbamoorthy. 2005

机译：两种基于距离的离群值的有意知识挖掘算法的实证性能分析。
6. Empirical study of seven data mining algorithms on different characteristics of datasets for biomedical classification applications [O] . Yiyan Zhang, Yi Xin, Qin Li, 2017

机译：七种数据挖掘算法在生物医学分类应用中不同数据集特征的实证研究
7. Fast mining of distance-based outliers in high dimensional datasets [O] . Amol Ghoting, Srinivasan Parthasarathy, Matthew Eric Otey 2006

机译：在高维数据集中快速挖掘基于距离的异常值

Algorithms for Mining Distance-Based Outliers in Large Datasets

摘要

著录项

相似文献

相关主题

期刊订阅