Algorithms for Mining Distance-Based Outliers in Large Datasets

机译：大数据集中基于距离的离群值挖掘算法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper deals with finding outliers (exceptions) in large, multidimensional datasets. The identification of outliers can lead to the discovery of truly unexpected knowledge in areas such as electronic commerce, credit card fraud, and even the analysis of performance statistics of professional athletes. Existing methods that we have seen for finding outliers in large datasets can only deal efficiently with two dimensions/attributes of a dataset. Here, we study the notion of DB- (Distance-Based) outliers. While we provide formal and empirical evidence showing the usefulness of DB-outliers, we focus on the development of algorithms for computing such outliers.

机译：本文涉及在大型多维数据集中查找离群值（异常）。异常值的识别可以导致在电子商务，信用卡欺诈甚至职业运动员的表现统计分析等领域发现真正出乎意料的知识。我们已经看到的用于在大型数据集中查找异常值的现有方法只能有效地处理数据集的两个维度/属性。在这里，我们研究基于DB的离群值的概念。尽管我们提供了证明数据库异常值有用的正式和经验证据，但我们仍专注于计算此类异常值的算法的开发。

著录项

来源
《Proceedings of the Twenty-fourth International Conference on Very Large Databases New York, NY, USA 24-27, August, 1998》|1998年|p.392-403|共12页
会议地点 New York NY(US);New York NY(US)
作者
Edwin M. Knorr; Raymond T. Ng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Fast mining of distance-based outliers in high-dimensional datasets [J] . Amol Ghoting, Srinivasan Parthasarathy, Matthew Eric Otey Data Mining and Knowledge Discovery . 2008,第3期

机译：快速挖掘高维数据集中基于距离的离群值
2. Fast mining of distance-based outliers in high-dimensional datasets [J] . Ghoting A, Parthasarathy S, Otey ME Data mining and knowledge discovery . 2008,第3期

机译：高维数据集中基于距离的离群值的快速挖掘
3. Research on Algorithms for Mining Distance-Based Outliers [J] . WANGLizhen, ZOULikun 电子学报：英文版 . 2005,第003期

机译：基于距离的离群值挖掘算法研究
4. Algorithms for Mining Distance-Based Outliers in Large Datasets [C] . Edwin M. Knorr, Raymond T. Ng International conference on very large databases . 1998

机译：大型数据集中挖掘距离的异常值的算法
5. Empirical performance analysis of two algorithms for mining intentional knowledge of distance-based outliers. [D] . Prasanthi, Enbamoorthy. 2005

机译：两种基于距离的离群值的有意知识挖掘算法的实证性能分析。
6. Empirical study of seven data mining algorithms on different characteristics of datasets for biomedical classification applications [O] . Yiyan Zhang, Yi Xin, Qin Li, 2017

机译：七种数据挖掘算法在生物医学分类应用中不同数据集特征的实证研究
7. Fast mining of distance-based outliers in high dimensional datasets [O] . Amol Ghoting, Srinivasan Parthasarathy, Matthew Eric Otey 2006

机译：在高维数据集中快速挖掘基于距离的异常值

Algorithms for Mining Distance-Based Outliers in Large Datasets

摘要

著录项

相似文献

相关主题

期刊订阅