首页> 外文期刊>Bioinformatics >An improved algorithm for clustering gene expression data
【24h】

An improved algorithm for clustering gene expression data

机译:一种改进的基因表达数据聚类算法

获取原文
获取原文并翻译 | 示例
       

摘要

Motivation: Recent advancements in microarray technology allows simultaneous monitoring of the expression levels of a large number of genes over different time points. Clustering is an important tool for analyzing such microarray data, typical properties of which are its inherent uncertainty, noise and imprecision. In this article, a two-stage clustering algorithm, which employs a recently proposed variable string length genetic scheme and a multiobjective genetic clustering algorithm, is proposed. It is based on the novel concept of points having significant membership to multiple classes. An iterated version of the well-known Fuzzy C-Means is also utilized for clustering. Results: The significant superiority of the proposed two-stage clustering algorithm as compared to the average linkage method, Self Organizing Map (SOM) and a recently developed weighted Chinese restaurant-based clustering method (CRC), widely used methods for clustering gene expression data, is established on a variety of artificial and publicly available real life data sets. The biological relevance of the clustering solutions are also analyzed.
机译:动机:微阵列技术的最新进展允许在不同时间点同时监视大量基因的表达水平。聚类是分析此类微阵列数据的重要工具,其典型特性是其固有的不确定性,噪声和不精确性。本文提出了一种两阶段聚类算法,该算法采用了最近提出的可变字符串长度遗传方案和多目标遗传聚类算法。它基于对多个类别具有显着成员资格的积分的新颖概念。众所周知的Fuzzy C-Means的迭代版本也用于聚类。结果:与平均链接方法,自组织映射(SOM)和最近开发的加权中国餐馆加权聚类方法(CRC)相比,拟议的两阶段聚类算法具有明显的优势,该方法广泛用于基因表达数据的聚类基于各种人工和公开可用的现实生活数据集建立。还分析了聚类解决方案的生物学相关性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号