首页> 外文期刊>Fuzzy sets and systems >Clustering of unevenly sampled gene expression time-series data
【24h】

Clustering of unevenly sampled gene expression time-series data

机译:不均匀采样的基因表达时间序列数据的聚类

获取原文
获取原文并翻译 | 示例
       

摘要

Time course measurements are becoming a common type of experiment in the use of microarrays. The temporal order of the data and the varying length of sampling intervals are important and should be considered in clustering time-series. However, the shortness of gene expression time-series data limits the use of conventional statistical models and techniques for time-series analysis. To address this problem, this paper proposes the fuzzy short time-series (FSTS) clustering algorithm, which clusters profiles based on the similarity of their relative change of expression level and the corresponding temporal information. One of the major advantages of fuzzy clustering is that genes can belong to more than one group, revealing distinctive features of each gene's function and regulation. Several examples are provided to illustrate the performance of the proposed algorithm. In addition, we present the validation of the algorithm by clustering the genes which define the model profiles in Chu et al. (Science, 282 (1998) 699). The fuzzy c-means, k-means, average linkage hierarchical algorithm and random clustering are compared to the proposed FSTS algorithm. The performance is evaluated with a well-established cluster validity measure proving that the FSTS algorithm has a better performance than the compared algorithms in clustering similar rates of change of expression in successive unevenly distributed time points. Moreover, the FSTS algorithm was able to cluster in a biologically meaningful way the genes defining the model profiles.
机译:时程测量正在成为使用微阵列实验的一种常见类型。数据的时间顺序和采样间隔的变化长度很重要,在聚类时间序列中应予以考虑。然而,基因表达时间序列数据的短缺限制了常规统计模型和技术用于时间序列分析的使用。为了解决这个问题,本文提出了一种模糊短时间序列(FSTS)聚类算法,该算法基于表达水平的相对变化和对应的时间信息的相似性来聚类轮廓。模糊聚类的主要优点之一是基因可以属于多个组,从而揭示每个基因的功能和调控的独特特征。提供了几个示例来说明所提出算法的性能。此外,我们通过聚类定义Chu等人模型轮廓的基因,提出了算法的验证。 (Science,282(1998)699)。将模糊c均值,k均值,平均链接分层算法和随机聚类与所提出的FSTS算法进行了比较。通过建立良好的聚类有效性度量来评估性能,证明在连续相继不均匀分布的时间点中相似表达变化率的聚类中,FSTS算法具有比比较算法更好的性能。而且,FSTS算法能够以生物学上有意义的方式对定义模型图谱的基因进行聚类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号