首页> 中文期刊>计算机工程与应用 >基因表达时序数据的HMM层次聚类

基因表达时序数据的HMM层次聚类

     

摘要

The use of DNA microarray technology produces a large number of gene expression time series data.Clustering of these data is a significant approach to extract molecular bioinformation hidden in them.In this paper, a Hidden Markov Model-based Hierarchical Clustering (HMM-HC) method is presented to analyze gene expression time series data.Gene expression time series data are preprocessed according to their statistics, including normalizing them and discretizing them.HMMs are used to model the preprocessed data so as to take advantage of the time dependency between different time points in the gene profile.The built HMM models are clustered with hierarchical strategy to achieve clustering of the data.The experimental results show that this method can not only produce high-quality clusters,but also find out the appropriate number of clusters.%DNA微阵列技术的应用产生了大量的基因表达时序数据,对这些数据进行聚类是荻取其中隐含的生物分子信息的一种重要方法.提出了一种基于隐马尔可夫模型(HMM)的层次聚类方法,根据基因表达时序数据的统计特性对其进行标准化和离散化等预处理,用HMM对经过预处理的数据建模以利用基因表达时序数据不同时间点之间的相关性,用层次聚类方法对建立的模型进行聚类.实验结果表明该方法不仅能够产生好的聚类,而且能够确定最优的聚类数.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号