首页> 外文会议>Neural Networks and Computational Intelligence >COMPUTATIONAL METHODS FOR IDENTIFYING NUMBER OF CLUSTERS IN GENE EXPRESSION DATA
【24h】

COMPUTATIONAL METHODS FOR IDENTIFYING NUMBER OF CLUSTERS IN GENE EXPRESSION DATA

机译:识别基因表达数据中簇数的计算方法

获取原文
获取原文并翻译 | 示例

摘要

With the advent of microarray technology, there is a growing need to reliably extract biologically significant information from massive gene expression data. Clustering is one of the key steps in analyzing gene expression data by identifying groups of genes that manifest similar expression patterns. Many algorithms for clustering gene expression data have been reported in the literature. However, there has been limited progress on cluster validation and identifying the number of clusters available in gene expression data. In this paper, we investigate the relative merits of four algorithms in clustering two gene expression data sets. The clustering methods we investigated are the poplar self-organizing maps (SOM), adaptive double self-organizing maps (ADSOM), fuzzy c-means (FCM), and model based clustering method. Their corresponding clusters are validated using figure of merit (FOM), a hierarchical tree-based index, Xie-Beni index that gives a measure of compactness and separation of clusters, and an approximation called the Bayesian information criterion (BIC). Our intent is to provide with a useful guide for choosing the appropriate computational method for identification of number of clusters in gene expression data analysis. It was observed that ADSOM outsmarted the three other clustering methods in detecting the number of clusters available in the two gene expression data sets.
机译:随着微阵列技术的出现,越来越需要从大量基因表达数据中可靠地提取具有生物学意义的信息。聚类是通过鉴定表现出相似表达模式的基因组来分析基因表达数据的关键步骤之一。文献中已经报道了许多用于基因表达数据聚类的算法。但是,在簇验证和鉴定基因表达数据中可用簇的数目方面进展有限。在本文中,我们研究了聚类两个基因表达数据集的四种算法的相对优点。我们研究的聚类方法是杨树自组织图(SOM),自适应双自组织图(ADSOM),模糊c均值(FCM)和基于模型的聚类方法。使用品质因数(FOM),基于树的层次结构索引,Xie-Beni索引(可度量群集的紧密度和分离程度)以及称为贝叶斯信息准则(BIC)的近似值,可以验证其对应的群集。我们的目的是为选择合适的计算方法提供有用的指导,以识别基因表达数据分析中的簇数。观察到,在检测两个基因表达数据集中可用的簇数时,ADSOM优于其他三种簇方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号