首页> 外文会议>Advances in Artificial Intelligence >A Genetic K-means Clustering Algorithm Applied to Gene Expression Data
【24h】

A Genetic K-means Clustering Algorithm Applied to Gene Expression Data

机译:基因表达数据的遗传K-均值聚类算法

获取原文

摘要

One of the current main strategies to understand a biological process at genome level is to cluster genes by their expression data obtained from DNA microarray experiments. The classic K-means clustering algorithm is a deterministic search and may terminate in a locally optimal clustering. In this paper, a genetic K-means clustering algorithm, called GKMCA, for clustering in gene expression datasets is described. GKMCA is a hybridization of a genetic algorithm (GA) and the iterative optimal K-means algorithm (IOKMA). In GKMCA, each individual is encoded by a partition table which uniquely determines a clustering, and three genetic operators (selection, crossover, mutation) and an IOKM operator derived from IOKMA are employed. The superiority of the GKMCA over the IOKMA and over other GA-clustering algorithms without the IOKM operator is demonstrated for two real gene expression datasets.
机译:在基因组水平上理解生物学过程的当前主要策略之一是通过从DNA微阵列实验获得的基因表达数据对基因进行聚类。经典的K均值聚类算法是确定性搜索,可能会以局部最优聚类终止。本文介绍了一种用于基因表达数据集中的遗传K-均值聚类算法,称为GKMCA。 GKMCA是遗传算法(GA)和迭代最优K均值算法(IOKMA)的混合体。在GKMCA中,每个人均由唯一确定聚类的分区表编码,并使用了三个遗传算子(选择,交叉,突变)和从IOKMA派生的IOKM算子。对于两个真实的基因表达数据集,证明了GKMCA优于IOKMA和不具有IOKM运算符的其他GA聚类算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号