首页> 外文期刊>ACM transactions on knowledge discovery from data >Multi-View Low-Rank Analysis with Applications to Outlier Detection
【24h】

Multi-View Low-Rank Analysis with Applications to Outlier Detection

机译:多视图低秩分析及其在异常值检测中的应用

获取原文
获取原文并翻译 | 示例
           

摘要

Detecting outliers or anomalies is a fundamental problem in various machine learning and data mining applications. Conventional outlier detection algorithms are mainly designed for single-view data. Nowadays, data can be easily collected from multiple views, and many learning tasks such as clustering and classification have benefited from multi-view data. However, outlier detection from multi-view data is still a very challenging problem, as the data in multiple views usually have more complicated distributions and exhibit inconsistent behaviors. To address this problem, we propose a multi-view low-rank analysis (MLRA) framework for outlier detection in this article. MLRA pursuits outliers from a new perspective, robust data representation. It contains two major components. First, the cross-view low-rank coding is performed to reveal the intrinsic structures of data. In particular, we formulate a regularized rank-minimization problem, which is solved by an efficient optimization algorithm. Second, the outliers are identified through an outlier score estimation procedure. Different from the existing multi-view outlier detection methods, MLRA is able to detect two different types of outliers from multiple views simultaneously. To this end, we design a criterion to estimate the outlier scores by analyzing the obtained representation coefficients. Moreover, we extend MLRA to tackle the multi-view group outlier detection problem. Extensive evaluations on seven UCI datasets, the MovieLens, the USPS-MNIST, and the WebKB datasets demon strate that out approach outperforms several state-of-the-art outlier detection methods.
机译:在各种机器学习和数据挖掘应用程序中,检测异常值或异常是一个基本问题。常规的离群值检测算法主要设计用于单视图数据。如今,可以轻松地从多个视图中收集数据,并且许多学习任务(例如聚类和分类)都受益于多视图数据。但是,由于多视图中的数据通常具有更复杂的分布并且表现出不一致的行为,因此从多视图数据中进行离群值检测仍然是一个非常具有挑战性的问题。为了解决此问题,我们在本文中提出了一种用于异常检测的多视图低秩分析(MLRA)框架。 MLRA从新的角度追求离群值,即可靠的数据表示。它包含两个主要部分。首先,执行交叉视图低秩编码以揭示数据的固有结构。特别是,我们制定了正规化的秩最小化问题,可以通过高效的优化算法来解决。其次,通过离群值评估程序来识别离群值。与现有的多视图离群值检测方法不同,MLRA能够从多个视图中同时检测两种不同类型的离群值。为此,我们设计了一个标准,通过分析获得的表示系数来估计离群值。此外,我们扩展了MLRA以解决多视图组离群值检测问题。对七个UCI数据集(MovieLens,USPS-MNIST和WebKB数据集)的广泛评估表明,这种方法的性能优于几种最新的离群值检测方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号