Mining a Maximum Weighted Set of Disjoint Submatrices

机译：挖掘最大加权不相交子矩阵集

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The objective of the maximum weighted set of disjoint sub-matrices problem is to discover K disjoint submatrices that together cover the largest sum of entries of an input matrix. It has many practical data-mining applications, as the related biclustering problem, such as gene module discovery in bioinformatics. It differs from the maximum-weighted submatrix coverage problem introduced in [6] by the explicit formulation of disjunction constraints: submatrices must not overlap. In other words, all matrix entries must be covered by at most one submatrix. The particular case of K = 1, called the maximal-sum submatrix problem, was successfully tackled with constraint programming in [5]. Unfortunately, the case of K > 1 is more challenging to solve as the selection of rows cannot be decided in polynomial time solely from the selection of K sets of columns. It can be proved to be NP-hard. We introduce a hybrid column generation approach using constraint programming to generate columns. It is compared to a standard mixed integer linear programming (MILP) through experiments on synthetic datasets. Overall, fast and valuable solutions are found by column generation while the MILP approach cannot handle a large number of variables and constraints.

机译：不相交子矩阵最大加权集问题的目的是发现K个不相交子矩阵，它们一起覆盖了输入矩阵的所有条目的最大和。它具有许多实用的数据挖掘应用程序，例如相关的双重集群问题，例如生物信息学中的基因模块发现。它与[6]中引入的最大加权子矩阵覆盖问题的区别在于显式提出了析取约束：子矩阵不得重叠。换句话说，所有矩阵条目都必须被一个子矩阵最多覆盖。 K = 1的特殊情况称为最大和子矩阵问题，已在[5]中通过约束编程成功解决。不幸的是，要解决K> 1的情况更具挑战性，因为不能仅通过选择K组列来确定多项式时间内的行选择。可以证明它是NP难的。我们介绍一种使用约束编程生成列的混合列生成方法。通过对合成数据集进行实验，将其与标准混合整数线性规划（MILP）进行了比较。总体而言，通过列生成可以找到快速而有价值的解决方案，而MILP方法无法处理大量的变量和约束。

著录项

来源
《International conference on discovery science》|2019年|18-28|共11页
会议地点
作者
Vincent Branders; Guillaume Derval; Pierre Schaus; Pierre Dupont;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Constraint programming; Maximum weighted submatrix; Column generation; Maximum weighted set of disjoint submatrices problem; Bi-cliques; Data-mining;

机译：约束编程;最大加权子矩阵;列生成;不相交子矩阵问题的最大加权集;双峰;数据挖掘;

相似文献

外文文献
中文文献
专利

1. Finding Maximum Disjoint Set of Boundary Rectangles With Application to PCB Routing [J] . Amirmahdi Ahmadinejad, Hamid Zarrabi-Zadeh IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems . 2017,第3期

机译：查找边界矩形的最大不相交集并将其应用于PCB布线
2. Non covered vertices in Fibonacci cubes by a maximum set of disjoint hypercubes [J] . Mollard Michel Discrete Applied Mathematics . 2017,第期

机译：通过最大一组不相交的超级套件，非覆盖的顶点
3. Multi-layer Genetic Algorithm for Maximum Disjoint Reliable Set Covers Problem in Wireless Sensor Networks [J] . Abdulhalim Mayyadah F., Attea Baraa A. Wireless personal communications: An Internaional Journal . 2015,第1期

机译：无线传感器网络中最大不相交可靠集覆盖问题的多层遗传算法
4. Mining a Maximum Weighted Set of Disjoint Submatrices [C] . Vincent Branders, Guillaume Derval, Pierre Schaus, International Conference on Discovery Science . 2019

机译：挖掘最多加权的不相交的子曲线组
5. Solving Process Planning and Scheduling Problems Using the Concept of Maximum Weighted Independent Set [D] . Sun, Kai. 2020

机译：使用最大加权独立集的概念解决过程规划和调度问题
6. An exact algorithm for finding cancer driver somatic genome alterations: the weighted mutually exclusive maximum set cover problem [O] . Songjian Lu, Gunasheil Mandava, Gaibo Yan, 2016

机译：查找癌症驾驶员体细胞基因组改变的精确算法：加权互斥最大集覆盖问题
7. Non covered vertices in Fibonacci cubes by a maximum set of disjoint hypercubes [O] . Mollard, Michel 2016

机译：斐波那契立方体中的最大覆盖不相交顶点，由最大的不相交超立方体集组成

Mining a Maximum Weighted Set of Disjoint Submatrices

摘要

著录项

相似文献

相关主题

期刊订阅