【24h】

OPTICS-OF:identifying local outliers

机译:OPTICS-OF:确定本地异常值

获取原文
获取原文并翻译 | 示例

摘要

For many KDD applications finding the outliers,i.e. the rare events,is more interesting and useful than finding the common cases,e.g. detecting criminal activities in E-commerce.Being an outlier,however,is not just a binary property.Instead,it is a property that applies to a certain degree to each object in a data set,depending on how 'isolated' the object is,with respect to the surrounding clustering structure.In this paper,we formally introduce a new notion of outliers which bases outlier detection on the same theoretical foundation as density-based cluster analysis.Our notion of an outlier is 'local' in the sense that the outlier-de-gree of an object is determined by taking into account the clustering structure in a bounded neighborhood of the object.We demonstrate that this notion of an outlier is more appropriate for detercting different types of outliers than previous approaches,and we also present an algorithm for finding them.Furthermore,we show that by combining the outlier detection with a density-base method to analyze the clustering structure,we can get the outliers almost for free if we already want to perform a cluster analysis on a data set.
机译:对于许多KDD应用程序,发现异常值罕见事件比查找常见案例更有趣和有用。检测异常行为不是一个二进制属性。相反,它是一种属性,该属性在一定程度上适用于数据集中的每个对象,具体取决于对象的“隔离”程度在本文中,我们正式介绍了一种新的离群值概念,该值基于与基于密度的聚类分析相同的理论基础进行离群值检测。离群值的概念是“局部”对象的离群度是通过考虑对象有界邻域中的聚类结构来确定的。我们证明,离群的概念比以前的方法更适合于确定不同类型的离群,并且进一步,我们证明了通过将离群值检测与基于密度的方法相结合来分析聚类结构,如果我们已经想要p,则可以几乎免费获得离群值对数据集进行聚类分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号