首页> 外文会议>International Conference on Bio-Inspired Models of Network, Information, and Computing Systems >Clustering Time-Series Gene Expression Data with Unequal Time Intervals
【24h】

Clustering Time-Series Gene Expression Data with Unequal Time Intervals

机译:具有不等时间间隔的聚类时间序列基因表达数据

获取原文

摘要

Clustering gene expression data given in terms of time-series is a challenging problem that imposes its own particular constraints, namely exchanging two or more time points is not possible as it would deliver quite different results, and also it would lead to erroneous biological conclusions. We have focused on issues related to clustering gene expression temporal profiles, and devised a novel algorithm for clustering gene temporal expression profile microarray data. The proposed clustering method introduces the concept of profile alignment which is achieved by minimizing the area between two aligned profiles. The overall pattern of expression in the time-series context is accomplished by applying agglomerative clustering combined with profile alignment, and finding the optimal number of clusters by means of a variant of a clustering index, which can effectively decide upon the optimal number of clusters for a given dataset. The effectiveness of the proposed approach is demonstrated on two well-known datasets, yeast and serum, and corroborated with a set of pre-clustered yeast genes, which show a very high classification accuracy of the proposed method, though it is an unsupervised scheme.
机译:集群中的时间序列的形式给出的基因表达数据被强加自己特定的限制,即交换两个或多个时间点的具有挑战性的问题是不可能的,因为它会带来完全不同的结果,也将导致错误的生物学结论。我们把重点放在群集相关基因的表达时间分布的问题,并制定了聚类基因时空表达谱芯片数据的新算法。所提出的聚类方法介绍了一种通过最小化两个对准的轮廓之间的区域实现轮廓对准​​的概念。表达的在时间序列上下文中的整体图案通过由集群索引的变型中,它可以在集群的最佳数目有效地决定的手段施加凝聚聚类与轮廓对准结合,并且发现簇的最佳数目来完成给定数据集。所提出的方法的有效性证明在两个公知的数据集,酵母和血清,并与一组预先聚集的酵母基因,这表明了该方法的一个非常高的分类准确度,尽管它是一种无监督方案证实。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号