首页> 外文期刊>International journal of machine learning and cybernetics >Possibilistic biclustering algorithm for discovering value-coherent overlapping δ-biclusters
【24h】

Possibilistic biclustering algorithm for discovering value-coherent overlapping δ-biclusters

机译:发现值相干重叠δ-二类的可能性二类聚类算法

获取原文
获取原文并翻译 | 示例
           

摘要

One of the important tools for analyzing gene expression data is biclustering method. It focuses on finding a subset of genes and a subset of experimental conditions that together exhibit coherent behavior. However, most of the existing biclustering algorithms find exclusive biclusters, which is inappropriate in the context of biology. Since biological processes are not independent of each other, many genes may participate in multiple different processes. Hence, nonexclusive biclustering algorithms are required for finding overlapping biclusters. In this regard, a novel possibilistic biclustering algorithm is presented here to find highly overlapping biclusters of larger volume with mean squared residue lower than a predefined threshold. It judiciously incorporates the concept of possibilistic clustering algorithm into biclustering framework. The integration enables efficient selection of highly overlapping coherent biclusters with mean squared residue lower than a given threshold. The detailed formulation of the proposed possibilistic biclustering algorithm, along with a mathematical analysis on the convergence property, is presented. Some quantitative indices are introduced for evaluating the quality of generated biclusters. The effectiveness of the algorithm, along with a comparison with other algorithms, is demonstrated both qualitatively and quantitatively on yeast gene expression data set. In general, the proposed algorithm shows excellent performance at finding patterns in gene expression data.
机译:双基因组分析是分析基因表达数据的重要工具之一。它着重于找到一起表现出一致行为的基因子集和实验条件子集。但是,大多数现有的双聚类算法都可以找到专有的双聚类,这在生物学背景下是不合适的。由于生物学过程彼此不独立,因此许多基因可能参与多个不同的过程。因此,需要非排他的双簇算法来查找重叠的双簇。在这方面,这里提出了一种新的可能的二元聚类算法,以找到更大重叠量的二元聚类,且均方差低于预定阈值。明智地将可能的聚类算法的概念合并到双聚类框架中。通过积分,可以高效选择均方差低于给定阈值的高度重叠的相干双峰。提出了可能的双簇算法的详细公式,以及对收敛性的数学分析。引入了一些定量指标来评估生成的双簇的质量。在酵母基因表达数据集上定性和定量地证明了该算法的有效性以及与其他算法的比较。通常,所提出的算法在寻找基因表达数据中的模式方面表现出出色的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号