首页> 外文会议>International joint conference on artificial intelligence >Generalized Clustergrams for Overlapping Biclusters
【24h】

Generalized Clustergrams for Overlapping Biclusters

机译:用于重叠双板的概括集群图

获取原文

摘要

Many real-life datasets, such as those produced by gene expression studies, exhibit complex substructures at various levels of granularity and thus do not have unique well-defined numbers of clusters. In such cases, it is important to be able to trace the evolution of the individual clusters as the number of dimensions of the clustering is varied. While the dendrograms produced by bottom-up clustering methods such as hierarchical clustering are very useful for this purpose, the approach is known to produce unreliable clusters due to its instability w.r.t. resampling. Moreover, hierarchical clustering does not apply to overlapping (BI)clusters, such as those obtained in gene expression studies. On the other hand, the instability w.r.t. the initialization of top-down methods, such as k-means, prevents the comparison between clusters obtained at different dimensionalities. In this paper, we present a method for constructing generalized dendrograms for overlapping biclusters, which depict the evolution of the biclusters as their number is varied. An essential ingredient is a stable biclustering method based on positive tensor factorization of a number of nonnegative matrix factorization runs. We apply our approach to a large colon cancer dataset, which shows several distinct subclasses whose dimensional evolution must be carefully analyzed to enable a more meaningful biological interpretation and sub-classification.
机译:许多现实实际数据集,例如由基因表达研究产生的数据集,在各种粒度下表现出复杂的子结构,因此没有具有独特定义的簇。在这种情况下,重要的是能够追踪各种簇的演变,因为随着聚类的尺寸而变化。虽然由自下而上的聚类方法(例如分层聚类)产生的树形图对此目的非常有用,但是已知该方法由于其不稳定而产生不可靠的簇。重新采样。此外,分层聚类不适用于重叠(BI)簇,例如基因表达研究中获得的簇。另一方面,不稳定性W.R.T.自上而下方法的初始化,例如k均值,防止了在不同尺寸的簇之间的比较。在本文中,我们提出了一种用于构建用于重叠双格图的通用树形图的方法,其描绘了与它们的数量变化的双板的演变。基本成分是基于许多非负矩阵分子运行的正张量分解的稳定双褶皱方法。我们将我们的方法应用于大型结肠癌数据集,其显示了几个不同的子类,必须仔细分析尺寸演化以实现更有意义的生物解释和子分类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号