首页> 外文期刊>International Journal of Artificial Intelligence Tools: Architectures, Languages, Algorithms >AN IMPROVED BICLUSTERING METHOD FOR ANALYZING GENE EXPRESSION PROFILES
【24h】

AN IMPROVED BICLUSTERING METHOD FOR ANALYZING GENE EXPRESSION PROFILES

机译:基因表达谱分析的改进的聚类方法

获取原文
获取原文并翻译 | 示例
           

摘要

Microarrays are one of the latest breakthroughs in experimental molecular biology, which provide a powerful tool by which the expression patterns of thousands of genes can be monitored simultaneously and are already producing huge amount of valuable data. The concept of bicluster was introduced by Cheng and Church1 to capture the coherence of a subset of genes and a subset of conditions. A set of heuristic algorithms were also designed to either find one bicluster or a set of biclusters, which consist of iterations of masking null values and discovered biclusters, coarse and fine node deletion, node addition, and the inclusion of inverted data. These heuristics inevitably suffer from some serious drawback. The masking of null values and discovered biclusters with random numbers may result in the phenomenon of random interference which in turn impacts the discovery of high quality biclusters. To address this issue and to further accelerate the biclustering process, we generalize the model of bicluster to incorporate null values and propose a probabilistic algorithm (FLOC) that can discover a set of k possibly overlapping biclusters simultaneously. Furthermore, this algorithm can easily be extended to support additional features that suit different requirements at virtually little cost. Experimental study on the yeast gene expression data shows that the FLOC algorithm can offer substantial improvements over the previously proposed algorithm.
机译:微阵列是实验分子生物学的最新突破之一,微阵列提供了一个强大的工具,通过它可以同时监视数千种基因的表达模式,并且已经产生了大量有价值的数据。 Cheng和Church1引入了双聚类概念,以捕获基因子集和条件子集的一致性。还设计了一组启发式算法,以查找一个或多个双簇,包括掩盖空值和发现的双簇的迭代,粗略和精细节点删除,节点添加以及包含反向数据。这些启发式方法不可避免地遭受一些严重的缺点。用随机数掩盖空值和发现的双峰可能会导致随机干扰现象,进而影响高质量双峰的发现。为了解决此问题并进一步加快双词簇的生成过程,我们推广了双词簇的模型以合并空值,并提出了一种概率算法(FLOC),该算法可以同时发现一组k个可能重叠的双词簇。此外,可以轻松扩展此算法以支持满足不同要求的其他功能,而所需费用却很少。对酵母基因表达数据的实验研究表明,FLOC算法可以对以前提出的算法进行实质性的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号