首页> 外文会议>International Conference of Soft Computing and Pattern Recognition >BiSim: A Simple and Efficient Biclustering Algorithm
【24h】

BiSim: A Simple and Efficient Biclustering Algorithm

机译:BISIM:一种简单而有效的BICLUSTING算法

获取原文

摘要

Analysis of gene expression data includes classification of the data into groups and subgroups based on similar expression patterns. Standard clustering methods for the analysis of gene expression data only identifies the global models while missing the local expression patterns. In order to identify the missed patterns biclustering approach has been introduced. Various biclustering algorithms have been proposed by scientists. Among them Binary Inclusion maximal algorithm (BiMax) forms biclusters when applied on a gene expression data through divide and conquer approach. The worst-case running-time complexity of BiMax for matrices containing disjoint biclusters is O(nmb) and for arbitrary matrices is of order O(nmb min{n, m}) where b is the number of all inclusion-maximal biclusters in matrix. In this paper we present an improved algorithm, BiSim, for biclustering which is easy and avoids complex computations as in BiMax. The complexity of our approach is O(n*m) for n genes and m conditions, i.e., a matrix of size n*m. Also it avoids extra computations within the same complexity class and avoids missing of any biclusters.
机译:基因表达数据的分析包括基于类似表达模式的数据分类为组和子组。用于分析基因表达数据的标准聚类方法仅在缺少本地表达式模式时识别全局模型。为了识别错过的模式,介绍了Biclesting方法。科学家已经提出了各种双板算法。其中二进制包含最大算法(Bimax)通过分割和征服方法在基因表达数据上施用时形成双板。包含不相交双板的矩阵的Bimax的最坏情况运行时间复杂度是O(NMB),并且对于任意矩阵是Order O(NMB min {n,m}),其中B是矩阵中的所有包含最大双板的数量。在本文中,我们提出了一种改进的算法BISIM,用于BIClustering,这很容易,避免了BIMAX中的复杂计算。我们的方法的复杂性是N基因和M条件的O(n * m),即大小为n * m的矩阵。此外,它还避免了在相同的复杂性类别中的额外计算,避免丢失任何双板。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号