Incorporating heterogeneous biological data sources in clustering gene expression data

Gang-Guo Li; Zheng-Zhi Wang

首页> 中文期刊>健康（英文） >Incorporating heterogeneous biological data sources in clustering gene expression data

Incorporating heterogeneous biological data sources in clustering gene expression data

开具论文收录证明 >>

期刊封面封底目录下载 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, a similarity measure between genes with protein-protein interactions is pro-posed. The chip-chip data are converted into the same form of gene expression data with pear-son correlation as its similarity measure. On the basis of the similarity measures of protein- protein interaction data and chip-chip data, thecombined dissimilarity measure is defined. The combined distance measure is introduced into K-means method, which can be considered as an improved K-means method. The improved K-means method and other three clustering methods are evaluated by a real dataset. Per-formance of these methods is assessed by a prediction accuracy analysis through known gene annotations. Our results show that the improved K-means method outperforms other clustering methods. The performance of the improved K-means method is also tested by varying the tuning coefficients of the combined dissimilarity measure. The results show that it is very helpful and meaningful to incorporate het-erogeneous data sources in clustering gene expression data, and those coefficients for the genome-wide or completed data sources should be given larger values when constructing the combined dissimilarity measure.

著录项

来源
《健康（英文）》|2009年第1期|17-23|共7页
作者
Gang-Guo Li; Zheng-Zhi Wang;
展开▼
作者单位

不详;

展开▼
原文格式 PDF
正文语种 chi
中图分类肿瘤学;
关键词
Statistical; Analysis; Similarity/; Dissimilarity; Measure; Gene; Expression; Data; Clustering; Data; Fusion;

相似文献

中文文献
外文文献
专利

Incorporating heterogeneous biological data sources in clustering gene expression data

摘要

著录项

相似文献

相关主题

期刊订阅