UoI-NMF Cluster: A Robust Nonnegative Matrix Factorization Algorithm for Improved Parts-Based Decomposition and Reconstruction of Noisy Data

机译：UoI-NMF簇：一种改进的基于零件的噪声数据分解和重构的鲁棒非负矩阵分解算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

With the ever growing collection of large volumes of scientific data, development of interpretable machine learning tools to analyze such data is becoming more important. However, robust, interpretable machine learning tools are lacking, threatening extraction of scientific insight and discovery. Nonnegative Matrix Factorization (NMF) algorithms decompose an m × n nonnegative data matrix A into a k × n basis matrix H and an m × k weight matrix W, such that A ≈ WH, where k is the desired rank. In this paper, we present a novel two stage algorithm, UoI-NMF_clusterfor NMF, which is based on three innovations: (i) completely separate bases learning from weight estimation, (ii) learn bases by clustering NMF results across bootstrap resamples of the data, and (iii) use the recently introduced Union of Intersections (UoI) framework to estimate ultra-sparse weights that maximize data reconstruction accuracy. We deploy our algorithm on various synthetic and scientific data to illustrate its performance, with a focus on neuroscience data. Compared to other NMF algorithms, UoI-NMF_clusteryields: a) more accurate parts-based decompositions of noisy data, b) a sparse and accurate weight matrix, and c) high accuracy reconstructions of the de-noised data. Together, these improvements enhance the performance and interpretability of NMF application to noisy data, and suggest similar approaches may benefit other matrix decomposition algorithms.

机译：随着大量科学数据的收集不断增长，开发可解释的机器学习工具来分析此类数据变得越来越重要。但是，缺少健壮的，可解释的机器学习工具，这威胁着对科学见解和发现的提取。非负矩阵分解（NMF）算法将m×n非负数据矩阵A分解为k×n基本矩阵H和m×k权重矩阵W，使得A≈WH，其中k是期望的等级。在本文中，我们提出了一种新颖的两阶段算法，UoI-NMF _{集群
NMF是基于三项创新的：（i）将基础学习与权重估计完全分开;（ii）通过在数据的自举重采样中对NMF结果进行聚类来学习基础;（iii）使用最近引入的相交联合（UoI））框架来估算超稀疏的权重，从而最大程度地提高数据重建的准确性。我们将算法部署在各种合成和科学数据上以说明其性能，重点是神经科学数据。与其他NMF算法相比，UoI-NMF
_{集群
产生：a）噪声数据的基于零件的更准确分解，b）稀疏且准确的权重矩阵，以及c）去噪数据的高精度重构。总之，这些改进提高了NMF应用程序对嘈杂数据的性能和可解释性，并表明类似的方法可能会使其他矩阵分解算法受益。}}

著录项

来源
《IEEE International Conference on Machine Learning and Applications》|2017年|241-248|共8页
会议地点
作者
Shashanka Ubaru; Kesheng Wu; Kristofer E. Bouchard;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Matrix decomposition; Estimation; Feature extraction; Noise measurement; Clustering algorithms; Sparse matrices; Optimization;

机译：矩阵分解;估计;特征提取;噪声测量;聚类算法;稀疏矩阵;优化;

相似文献

外文文献
中文文献
专利

1. A Robust Manifold Graph Regularized Nonnegative Matrix Factorization Algorithm for Cancer Gene Clustering [J] . Rong Zhu, Jin-Xing Liu, Yuan-Ke Zhang, Molecules . 2017,第12期

机译：用于癌症基因聚类的鲁棒流形图正则化非负矩阵分解算法
2. A Robust Manifold Graph Regularized Nonnegative Matrix Factorization Algorithm for Cancer Gene Clustering [J] . Zhu Rong, Liu Jin-Xing, Zhang Yuan-Ke, Molecules . 2017,第12期

机译：一种鲁棒歧管图正规化非负矩阵分解算法癌症基因聚类
3. Community Detection Algorithm Based on Nonnegative Matrix Factorization and Improved Density Peak Clustering [J] . Lu Hong, Sang Xiaoshuang, Zhao Qinghua, Quality Control, Transactions . 2020,第期

机译：基于非负矩阵分解的社区检测算法和改进密度峰簇
4. UoI-NMF Cluster: A Robust Nonnegative Matrix Factorization Algorithm for Improved Parts-Based Decomposition and Reconstruction of Noisy Data [C] . Shashanka Ubaru, Kesheng Wu, Kristofer E. Bouchard IEEE International Conference on Machine Learning and Applications . 2017

机译：UOI-NMF集群：一种强大的非环境矩阵分解算法，用于改进基于部分的分解和噪声数据的重建
5. Nonnegative matrix factorization: Analysis, algorithms and applications [D] . Prasad, Upendra 2009

机译：非负矩阵分解：分析，算法和应用
6. A Robust Manifold Graph Regularized Nonnegative Matrix Factorization Algorithm for Cancer Gene Clustering [O] . Rong Zhu, Jin-Xing Liu, Yuan-Ke Zhang, 2017

机译：用于癌症基因聚类的鲁棒流形图正则化非负矩阵分解算法
7. A Robust Manifold Graph Regularized Nonnegative Matrix Factorization Algorithm for Cancer Gene Clustering [O] . Rong Zhu, Jin-Xing Liu, Yuan-Ke Zhang, 2017

机译：一种稳健流形图规则化非癌矩阵因子分解算法用于癌症基因聚类

UoI-NMF Cluster: A Robust Nonnegative Matrix Factorization Algorithm for Improved Parts-Based Decomposition and Reconstruction of Noisy Data

摘要

著录项

相似文献

相关主题

期刊订阅