首页> 外文会议>IEEE International Conference on Machine Learning and Applications >UoI-NMF Cluster: A Robust Nonnegative Matrix Factorization Algorithm for Improved Parts-Based Decomposition and Reconstruction of Noisy Data
【24h】

UoI-NMF Cluster: A Robust Nonnegative Matrix Factorization Algorithm for Improved Parts-Based Decomposition and Reconstruction of Noisy Data

机译:UOI-NMF集群:一种强大的非环境矩阵分解算法,用于改进基于部分的分解和噪声数据的重建

获取原文

摘要

With the ever growing collection of large volumes of scientific data, development of interpretable machine learning tools to analyze such data is becoming more important. However, robust, interpretable machine learning tools are lacking, threatening extraction of scientific insight and discovery. Nonnegative Matrix Factorization (NMF) algorithms decompose an m × n nonnegative data matrix A into a k × n basis matrix H and an m × k weight matrix W, such that A ≈ WH, where k is the desired rank. In this paper, we present a novel two stage algorithm, UoI-NMFclusterfor NMF, which is based on three innovations: (i) completely separate bases learning from weight estimation, (ii) learn bases by clustering NMF results across bootstrap resamples of the data, and (iii) use the recently introduced Union of Intersections (UoI) framework to estimate ultra-sparse weights that maximize data reconstruction accuracy. We deploy our algorithm on various synthetic and scientific data to illustrate its performance, with a focus on neuroscience data. Compared to other NMF algorithms, UoI-NMFclusteryields: a) more accurate parts-based decompositions of noisy data, b) a sparse and accurate weight matrix, and c) high accuracy reconstructions of the de-noised data. Together, these improvements enhance the performance and interpretability of NMF application to noisy data, and suggest similar approaches may benefit other matrix decomposition algorithms.
机译:随着越来越多的大量科学数据集合,可解释的机器学习工具的开发分析这些数据变得越来越重要。然而,缺乏强大的,可解释的机器学习工具,威胁要提取科学洞察力和发现。非负矩阵分解(NMF)算法将M×N非负数据矩阵A分解为K×N基矩阵H和M×K权重矩阵W,使得≈WH,其中k是所需的秩。在本文中,我们提出了一种新型两级算法,UOI-NMF cluster 对于NMF,基于三个创新:(i)完全独立的基础从体重估计中学习,(ii)通过跨越Bootstrap竞争的NMF结果来学习基础,并且(iii)使用最近引入的交叉协会(UOI )框架来估计超稀疏重量,最大化数据重建精度。我们在各种合成和科学数据上部署了算法,以说明其性能,专注于神经科学数据。与其他NMF算法相比,UOI-NMF cluster 产量:a)更精确的基于零件的噪声数据分解,b)稀疏和精确的重量矩阵,以及C)高精度重建的去噪数据。这些改进在一起,提高了NMF应用于嘈杂数据的性能和可解释性,并且建议类似的方法可以使其他矩阵分解算法有益。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号