首页> 外文会议>International Conference on Advances in Information Systems(ADVIS 2004); 20041020-22; Izmir(TR) >Multi-objective Genetic Algorithm Based Clustering Approach and Its Application to Gene Expression Data
【24h】

Multi-objective Genetic Algorithm Based Clustering Approach and Its Application to Gene Expression Data

机译:基于多目标遗传算法的聚类方法及其在基因表达数据中的应用

获取原文
获取原文并翻译 | 示例

摘要

Gene clustering is a common methodology for analyzing similar data based on expression trajectories. Clustering algorithms in general need the number of clusters as a priori, and this is mostly hard to estimate, even by domain experts. In this paper, we use Niched Pareto k-means Genetic Algorithm (GA) for clustering m-RNA data. After running the multi-objective GA, we get the pareto-optimal front that gives alternatives for the optimal number of clusters as a solution set. We analyze the clustering results under two cluster validity techniques commonly cited in the literature, namely DB index and SD index. This gives an idea about ranking the optimal numbers of clusters for each validity index. We tested the proposed clustering approach by conducting experiments using three data sets, namely figure2data, cancer (NCI60) and Leukaemia data. The obtained results are promising; they demonstrate the applicability and effectiveness of the proposed approach.
机译:基因聚类是基于表达轨迹分析相似数据的常用方法。聚类算法通常需要先验的聚类数量,而且即使是领域专家也很难估计。在本文中,我们使用Niched Pareto k-means遗传算法(GA)对m-RNA数据进行聚类。运行多目标GA之后,我们得到了pareto-optimal前端,该前端为解决方案集提供了最佳聚类数的替代方案。我们在文献中通常引用的两种聚类有效性技术(即DB索引和SD索引)下分析聚类结果。这给出了关于对每个有效性指标的最佳聚类数进行排名的想法。我们通过使用三个数据集(即fig2数据,癌症(NCI60)和白血病数据)进行实验,测试了建议的聚类方法。获得的结果是有希望的;他们证明了该方法的适用性和有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号