首页> 外文会议>2010 4th International Conference on Bioinformatics and Biomedical Engineering >A Novel Clustering Approach Based on the Manifold Structure of Gene Expression Data
【24h】

A Novel Clustering Approach Based on the Manifold Structure of Gene Expression Data

机译:基于基因表达数据流形结构的新型聚类方法

获取原文

摘要

Clustering is an effective approach for computing analysis of gene expression data. Various of clustering algorithms have been developed to give reasonable interpretations of biological data and discover biological meaningful patterns of cellular functions. Based on the manifold structure of gene expression data analyzed under the framework of geometric representation, a novel clustering approach is presented to reveal the nonlinear expression patterns. The novel clustering approach can be divided into the following computing steps. The first step is to construct a neighborhood graph for gene expression points through which the approximate geodesic distances between each two points can be obtained. Then, instead of Euclidean distance, approximate geodesic distance is exploited to reveal the similarity between gene profiles. Finally, via defining the geodesic distance between a cluster and a gene expression point, new clusters can be generated after essential iterative processes. Application of the approach to the yeast cell-cycle dataset validates its rationality and efficiency.
机译:聚类是用于计算基因表达数据分析的有效方法。已经开发了各种聚类算法,以对生物学数据进行合理的解释,并发现细胞功能的生物学有意义的模式。基于在几何表示框架下分析基因表达数据的流形结构,提出了一种新颖的聚类方法来揭示非线性表达模式。新颖的聚类方法可以分为以下计算步骤。第一步是为基因表达点构建邻域图,通过该邻域图可以获得每两个点之间的近似测地距离。然后,代替欧几里得距离,利用近似测地距离来揭示基因谱之间的相似性。最后,通过定义聚类和基因表达点之间的测地距离,可以在必要的迭代过程之后生成新的聚类。该方法在酵母细胞周期数据集上的应用验证了其合理性和效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号