首页> 外文期刊>Journal of information science and engineering >An Efficient Approach to Identifying and Validating Clusters in Multivariate Datasets with Applications in Gene Expression Analysis
【24h】

An Efficient Approach to Identifying and Validating Clusters in Multivariate Datasets with Applications in Gene Expression Analysis

机译:识别和验证多变量数据集中聚类的有效方法及其在基因表达分析中的应用

获取原文
获取原文并翻译 | 示例
           

摘要

Gene expression data analysis has become an important topic in bioinformatics due to its wide application in the biomedical industry. Effective analysis of gene expression data is an essential part of various data mining methods, especially the clustering techniques. Various kinds of clustering methods have been proposed, yet they do not satisfy for the requirements of high efficiency, high quality and automation in the mining of gene expression data. In this paper, we propose an efficient and automatic clustering approach that is suitable for gene expression analysis. The proposed approach primarily employs similarity-matrix based clustering techniques, complemented by new heuristics for reducing the computation cost. In particular, a novel validation technique is incorporated for evaluating the quality of the discovered gene expression patterns. Because it includes empirical evaluation of different gene expression datum, the proposed approach is able perform better than other methods in terms of efficiency, clustering quality and automation.
机译:基因表达数据分析由于其在生物医学行业中的广泛应用,已成为生物信息学中的重要课题。基因表达数据的有效分析是各种数据挖掘方法(尤其是聚类技术)的重要组成部分。已经提出了各种聚类方法,但是它们不满足基因表达数据的挖掘中的高效,高质量和自动化的要求。在本文中,我们提出了一种高效且自动的聚类方法,适用于基因表达分析。所提出的方法主要采用基于相似度矩阵的聚类技术,并辅之以新的启发式方法以减少计算成本。特别地,并入了新颖的验证技术以评估发现的基因表达模式的质量。由于它包括对不同基因表达数据的经验评估,因此在效率,聚类质量和自动化方面,该方法能够比其他方法表现更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号