Analysis of temporal gene expression profiles: clustering by simulated annealing and determining the optimal number of clusters

Alexander V. Lukashin; Rainer Fuchs

首页> 外文期刊>Bioinformatics >Analysis of temporal gene expression profiles: clustering by simulated annealing and determining the optimal number of clusters

【24h】

Analysis of temporal gene expression profiles: clustering by simulated annealing and determining the optimal number of clusters

机译：时间基因表达谱分析：通过模拟退火进行聚类并确定最佳聚类数

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Motivation: Cluster analysis of genome-wide expression data from DNA microarray hybridization studies has proved to be a useful tool for identifying biologically relevant groupings of genes and samples. In the present paper, we focus on several important issues related to clustering algorithms that have not yet been fully studied. Results: We describe a simple and robust algorithm for the clustering of temporal gene expression profiles that is based on the simulated annealing procedure. In general, this algorithm guarantees to eventually find the globally optimal distribution of genes over clusters. We introduce an iterative scheme that serves to evaluate quantitatively the optimal number of clusters for each specific data set. The scheme is based on standard approaches used in regular statistical tests. The basic idea is to organize the search of the optimal number of clusters simultaneously with the optimization of the distribution of genes over clusters. The efficiency of the proposed algorithm has been evaluated by means of a reverse engineering experiment, that is, a situation in which the correct distribution of genes over clusters is known a priori. The employment of this statistically rigorous test has shown that our algorithm places greater than 90% genes into correct clusters. Finally, the algorithm has been tested on real gene expression data (expression changes during yeast cell cycle) for which the fundamental patterns of gene expression and the assignment of genes to clusters are well understood from numerous previous studies. Availability: The source code of the program implementing the algorithm is available upon request from the authors. Contact: alex_-lukashin@biogen.com

机译：动机：来自DNA微阵列杂交研究的全基因组表达数据的聚类分析已被证明是鉴定基因和样品生物学相关分组的有用工具。在本文中，我们关注与聚类算法相关的几个重要问题，这些问题尚未得到充分研究。结果：我们描述了一种简单而强大的算法，用于基于模拟退火程序的时态基因表达谱的聚类。通常，此算法可保证最终找到整个簇上基因的全局最优分布。我们介绍了一种迭代方案，该方案可用于量化评估每个特定数据集的最佳聚类数。该方案基于常规统计测试中使用的标准方法。基本思想是组织最佳簇数的搜索，同时优化簇上基因的分布。已经通过逆向工程实验评估了所提出算法的效率，也就是先验地知道了基因在簇上的正确分布的情况。使用此统计严格的测试表明，我们的算法将90％以上的基因置于正确的簇中。最后，该算法已在真实的基因表达数据（酵母细胞周期中的表达变化）上进行了测试，对于该基因表达的基本模式和基因对簇的分配，已有大量研究得到了很好的理解。可用性：可以根据作者的要求提供实现该算法的程序的源代码。联系方式：alex_-lukashin@biogen.com

著录项

来源
《Bioinformatics》 |2001年第4期|共10页
作者
Alexander V. Lukashin; Rainer Fuchs;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类生物科学;生物工程学（生物技术）;
关键词

相似文献

外文文献
中文文献
专利

1. Analysis of temporal gene expression profiles: clustering by simulated annealing and determining the optimal number of clusters [J] . Alexander V. Lukashin, Rainer Fuchs Bioinformatics . 2001,第4期

机译：时间基因表达谱分析：通过模拟退火进行聚类并确定最佳聚类数
2. TA-clustering: Cluster analysis of gene expression profiles through Temporal Abstractions [J] . Lucia Sacchi, Riccardo Bellazzi, Cristiana Larizza, International journal of medical informatics . 2005,第7a8期

机译：TA聚类：通过时间抽象对基因表达谱进行聚类分析
3. Clustering of gene expression profiles: creating initialization-independent clusterings by eliminating unstable genes [J] . Wim De Mulder, Martin Kuiper, René Boel Journal of Integrative Bioinformatics . 2010,第3期

机译：基因表达谱的聚类：通过消除不稳定的基因来创建与初始化无关的聚类
4. Evolutionary Clustering Algorithm with Knowledge-Based Evaluation for Fuzzy Cluster Analysis of Gene Expression Profiles [C] . International Conference on Pattern Recognition and Machine Intelligence . 2007

机译：基于知识的基于知识簇评估的进化聚类算法，用于基因表达谱的模糊聚类分析
5. Statistical models for clustering dynamic gene expression profiles. [D] . Kim, Bong-Rae. 2006

机译：聚类动态基因表达谱的统计模型。
6. CLUSTERING OF GENE EXPRESSION DATA AND END-POINT MEASUREMENTS BY SIMULATED ANNEALING [O] . Pierre R. Bushel -1

机译：通过模拟退火对基因表达数据和终点测量进行聚类
7. Analysis of temporal gene expression profiles: clustering by simulated annealing and determining the optimal number of clusters [O] . A. V. Lukashin, R. Fuchs 2001

机译：颞基因表达谱分析：通过模拟退火的聚类并确定最佳簇数

Analysis of temporal gene expression profiles: clustering by simulated annealing and determining the optimal number of clusters

摘要

著录项

相似文献

相关主题

期刊订阅