【24h】

TRICLUSTER

机译:三头肌

获取原文

摘要

In this paper we introduce a novel algorithm called TRICLUSTER, for mining coherent clusters in three-dimensional (3D) gene expression datasets. TRICLUSTER can mine arbitrarily positioned and overlapping clusters, and depending on different parameter values, it can mine different types of clusters, including those with constant or similar values along each dimension, as well as scaling and shifting expression patterns. TRICLUSTER relies on graph-based approach to mine all valid clusters. For each time slice, i.e., a gene×sample matrix, it constructs the range multigraph, a compact representation of all similar value ranges between any two sample columns. It then searches for constrained maximal cliques in this multigraph to yield the set of bi-clusters for this time slice. Then TRICLUSTER constructs another graph using the biclusters (as vertices) from each time slice; mining cliques from this graph yields the final set of triclusters. Optionally, TRICLUSTER merges/deletes some clusters having large overlaps. We present a useful set of metrics to evaluate the clustering quality, and we show that TRICLUSTER can find significant triclusters in the real microarray datasets.
机译:在本文中,我们引入了一种称为TRICLUSTER的新颖算法,用于挖掘三维(3D)基因表达数据集中的相干簇。 TRICLUSTER可以挖掘任意定位和重叠的聚类,并且可以根据不同的参数值来挖掘不同类型的聚类,包括在每个维度上具有恒定或相似值的聚类,以及缩放和移动表达模式。 TRICLUSTER依靠基于图的方法来挖掘所有有效集群。对于每个时间片,即一个基因×样本矩阵,它构造了范围多图,这是任意两个样本列之间所有相似值范围的紧凑表示。然后,它在此多图中搜索受约束的最大集团,以生成该时间片的双集群集。然后,TRICLUSTER使用每个时间片中的双峰(作为顶点)构造另一个图;从该图中的挖掘集团可以得出最终的三角洲。可选地,TRICLUSTER合并/删除一些重叠较大的群集。我们提出了一套有用的指标来评估聚类质量,并且我们表明TRICLUSTER可以在真实的微阵列数据集中找到重要的tricluster。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号