UoI-NMF Cluster: A Robust Nonnegative Matrix Factorization Algorithm for Improved Parts-Based Decomposition and Reconstruction of Noisy Data

机译：UOI-NMF集群：一种强大的非环境矩阵分解算法，用于改进基于部分的分解和噪声数据的重建

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

With the ever growing collection of large volumes of scientific data, development of interpretable machine learning tools to analyze such data is becoming more important. However, robust, interpretable machine learning tools are lacking, threatening extraction of scientific insight and discovery. Nonnegative Matrix Factorization (NMF) algorithms decompose an m × n nonnegative data matrix A into a k × n basis matrix H and an m × k weight matrix W, such that A ≈ WH, where k is the desired rank. In this paper, we present a novel two stage algorithm, UoI-NMF_clusterfor NMF, which is based on three innovations: (i) completely separate bases learning from weight estimation, (ii) learn bases by clustering NMF results across bootstrap resamples of the data, and (iii) use the recently introduced Union of Intersections (UoI) framework to estimate ultra-sparse weights that maximize data reconstruction accuracy. We deploy our algorithm on various synthetic and scientific data to illustrate its performance, with a focus on neuroscience data. Compared to other NMF algorithms, UoI-NMF_clusteryields: a) more accurate parts-based decompositions of noisy data, b) a sparse and accurate weight matrix, and c) high accuracy reconstructions of the de-noised data. Together, these improvements enhance the performance and interpretability of NMF application to noisy data, and suggest similar approaches may benefit other matrix decomposition algorithms.

机译：随着越来越多的大量科学数据集合，可解释的机器学习工具的开发分析这些数据变得越来越重要。然而，缺乏强大的，可解释的机器学习工具，威胁要提取科学洞察力和发现。非负矩阵分解（NMF）算法将M×N非负数据矩阵A分解为K×N基矩阵H和M×K权重矩阵W，使得≈WH，其中k是所需的秩。在本文中，我们提出了一种新型两级算法，UOI-NMF_{cluster 对于NMF，基于三个创新：（i）完全独立的基础从体重估计中学习，（ii）通过跨越Bootstrap竞争的NMF结果来学习基础，并且（iii）使用最近引入的交叉协会（UOI ）框架来估计超稀疏重量，最大化数据重建精度。我们在各种合成和科学数据上部署了算法，以说明其性能，专注于神经科学数据。与其他NMF算法相比，UOI-NMF_{cluster 产量：a）更精确的基于零件的噪声数据分解，b）稀疏和精确的重量矩阵，以及C）高精度重建的去噪数据。这些改进在一起，提高了NMF应用于嘈杂数据的性能和可解释性，并且建议类似的方法可以使其他矩阵分解算法有益。}}

著录项

来源
《IEEE International Conference on Machine Learning and Applications》|2017年|579|共8页
会议地点
作者
Shashanka Ubaru; Kesheng Wu; Kristofer E. Bouchard;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP181-53;
关键词
Matrix decomposition; Estimation; Feature extraction; Noise measurement; Clustering algorithms; Sparse matrices; Optimization;

机译：矩阵分解;估计;特征提取;噪声测量;聚类算法;稀疏矩阵;优化;

相似文献

外文文献
中文文献
专利

1. A Robust Manifold Graph Regularized Nonnegative Matrix Factorization Algorithm for Cancer Gene Clustering [J] . Rong Zhu, Jin-Xing Liu, Yuan-Ke Zhang, Molecules . 2017,第12期

机译：用于癌症基因聚类的鲁棒流形图正则化非负矩阵分解算法
2. A Robust Manifold Graph Regularized Nonnegative Matrix Factorization Algorithm for Cancer Gene Clustering [J] . Zhu Rong, Liu Jin-Xing, Zhang Yuan-Ke, Molecules . 2017,第12期

机译：一种鲁棒歧管图正规化非负矩阵分解算法癌症基因聚类
3. Community Detection Algorithm Based on Nonnegative Matrix Factorization and Improved Density Peak Clustering [J] . Lu Hong, Sang Xiaoshuang, Zhao Qinghua, Quality Control, Transactions . 2020,第期

机译：基于非负矩阵分解的社区检测算法和改进密度峰簇
4. UoI-NMF Cluster: A Robust Nonnegative Matrix Factorization Algorithm for Improved Parts-Based Decomposition and Reconstruction of Noisy Data [C] . Shashanka Ubaru, Kesheng Wu, Kristofer E. Bouchard IEEE International Conference on Machine Learning and Applications . 2017

机译：UoI-NMF簇：一种改进的基于零件的噪声数据分解和重构的鲁棒非负矩阵分解算法
5. Nonnegative matrix factorization: Analysis, algorithms and applications [D] . Prasad, Upendra 2009

机译：非负矩阵分解：分析，算法和应用
6. A Robust Manifold Graph Regularized Nonnegative Matrix Factorization Algorithm for Cancer Gene Clustering [O] . Rong Zhu, Jin-Xing Liu, Yuan-Ke Zhang, 2017

机译：用于癌症基因聚类的鲁棒流形图正则化非负矩阵分解算法
7. A Robust Manifold Graph Regularized Nonnegative Matrix Factorization Algorithm for Cancer Gene Clustering [O] . Rong Zhu, Jin-Xing Liu, Yuan-Ke Zhang, 2017

机译：一种稳健流形图规则化非癌矩阵因子分解算法用于癌症基因聚类

UoI-NMF Cluster: A Robust Nonnegative Matrix Factorization Algorithm for Improved Parts-Based Decomposition and Reconstruction of Noisy Data

摘要

著录项

相似文献

相关主题

期刊订阅