首页> 外文期刊>Data mining and knowledge discovery >The Boolean column and column-row matrix decompositions
【24h】

The Boolean column and column-row matrix decompositions

机译:布尔列和列行矩阵分解

获取原文
获取原文并翻译 | 示例
           

摘要

Matrix decompositions are used for many data mining purposes. One of these purposes is to find a concise but interpretable representation of a given data matrix. Different decomposition formulations have been proposed for this task, many of which assume a certain property of the input data ( e. g., nonnegativity) and aim at preserving that property in the decomposition. In this paper we propose new decomposition formulations for binary matrices, namely the Boolean CX and CUR decompositions. They are natural combinations of two previously presented decomposition formulations. We consider also two subproblems of these decompositions and present a rigorous theoretical study of the subproblems. We give algorithms for the decompositions and for the subproblems, and study their performance via extensive experimental evaluation. We show that even simple algorithms can give accurate and intuitive decompositions of real data, thus demonstrating the power and usefulness of the proposed decompositions.
机译:矩阵分解用于许多数据挖掘目的。这些目的之一是找到给定数据矩阵的简洁但可解释的表示形式。已经针对该任务提出了不同的分解公式,其中许多分解公式假设输入数据的特定属性(例如,非负性),并且旨在在分解中保留该属性。在本文中,我们为二进制矩阵提出了新的分解公式,即布尔CX和CUR分解。它们是两种先前提出的分解配方的自然组合。我们还考虑了这些分解的两个子问题,并对这些子问题进行了严格的理论研究。我们给出了分解和子问题的算法,并通过广泛的实验评估来研究其性能。我们证明,即使是简单的算法也可以对真实数据进行准确而直观的分解,从而证明了所提出分解的强大功能和有用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号