首页> 外文会议>IEEE International Conference on Machine Learning and Applications >UoI-NMF Cluster: A Robust Nonnegative Matrix Factorization Algorithm for Improved Parts-Based Decomposition and Reconstruction of Noisy Data
【24h】

UoI-NMF Cluster: A Robust Nonnegative Matrix Factorization Algorithm for Improved Parts-Based Decomposition and Reconstruction of Noisy Data

机译:UoI-NMF簇:一种改进的基于零件的噪声数据分解和重构的鲁棒非负矩阵分解算法

获取原文

摘要

With the ever growing collection of large volumes of scientific data, development of interpretable machine learning tools to analyze such data is becoming more important. However, robust, interpretable machine learning tools are lacking, threatening extraction of scientific insight and discovery. Nonnegative Matrix Factorization (NMF) algorithms decompose an m × n nonnegative data matrix A into a k × n basis matrix H and an m × k weight matrix W, such that A ≈ WH, where k is the desired rank. In this paper, we present a novel two stage algorithm, UoI-NMFclusterfor NMF, which is based on three innovations: (i) completely separate bases learning from weight estimation, (ii) learn bases by clustering NMF results across bootstrap resamples of the data, and (iii) use the recently introduced Union of Intersections (UoI) framework to estimate ultra-sparse weights that maximize data reconstruction accuracy. We deploy our algorithm on various synthetic and scientific data to illustrate its performance, with a focus on neuroscience data. Compared to other NMF algorithms, UoI-NMFclusteryields: a) more accurate parts-based decompositions of noisy data, b) a sparse and accurate weight matrix, and c) high accuracy reconstructions of the de-noised data. Together, these improvements enhance the performance and interpretability of NMF application to noisy data, and suggest similar approaches may benefit other matrix decomposition algorithms.
机译:随着大量科学数据的收集不断增长,开发可解释的机器学习工具来分析此类数据变得越来越重要。但是,缺少健壮的,可解释的机器学习工具,这威胁着对科学见解和发现的提取。非负矩阵分解(NMF)算法将m×n非负数据矩阵A分解为k×n基本矩阵H和m×k权重矩阵W,使得A≈WH,其中k是期望的等级。在本文中,我们提出了一种新颖的两阶段算法,UoI-NMF 集群 NMF是基于三项创新的:(i)将基础学习与权重估计完全分开;(ii)通过在数据的自举重采样中对NMF结果进行聚类来学习基础;(iii)使用最近引入的相交联合(UoI) )框架来估算超稀疏的权重,从而最大程度地提高数据重建的准确性。我们将算法部署在各种合成和科学数据上以说明其性能,重点是神经科学数据。与其他NMF算法相比,UoI-NMF 集群 产生:a)噪声数据的基于零件的更准确分解,b)稀疏且准确的权重矩阵,以及c)去噪数据的高精度重构。总之,这些改进提高了NMF应用程序对嘈杂数据的性能和可解释性,并表明类似的方法可能会使其他矩阵分解算法受益。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号