首页> 外文会议>2011 IEEE International Conference on Bioinformatics and Biomedicine >Internal Evaluation Measures as Proxies for External Indices in Clustering Gene Expression Data
【24h】

Internal Evaluation Measures as Proxies for External Indices in Clustering Gene Expression Data

机译:内部评估措施作为基因表达数据聚类中外部指标的代表

获取原文

摘要

Several external indices that use information not present in the dataset were shown to be useful for evaluation of representative based clustering algorithms. However, such supervised measures are not directly useful for construction of better clustering algorithms when class labels are not provided. We propose a method for identifying internal cluster evaluation measures that use only information present in the dataset and are related to given external indices. We utilize these internal measures for the construction of representative based clustering algorithms. Both identification and utilization steps of the proposed method are enabled by use of a component-based clustering algorithm design. Experiments on 432 algorithms using gene expression data sets provide evidence that some internal measures could be used as surrogates for external indices proposed in the literature. Moreover, the obtained results suggest that internal measures correlated to selected external indices can guide the algorithms toward significantly better cluster models.
机译:已显示使用数据集中不存在的信息的几个外部索引对于评估基于代表性的聚类算法很有用。但是,当没有提供类标签时,这种监督措施对构建更好的聚类算法没有直接的帮助。我们提出了一种用于识别内部聚类评估指标的方法,该方法仅使用数据集中存在的信息并与给定的外部索引相关。我们利用这些内部措施来构建基于代表性的聚类算法。通过使用基于组件的聚类算法设计,可以实现该方法的识别步骤和使用步骤。使用基因表达数据集对432算法进行的实验提供了证据,表明某些内部度量可用作文献中提出的外部指标的替代。此外,获得的结果表明,与选定的外部索引相关的内部度量可以指导算法朝着明显更好的聚类模型发展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号