首页> 外文期刊>Expert systems with applications >Intelligent System For The Analysis Of Microarray Data Using Principal Components And Estimation Of Distribution Algorithms
【24h】

Intelligent System For The Analysis Of Microarray Data Using Principal Components And Estimation Of Distribution Algorithms

机译:基于主成分和分布算法估计的微阵列数据智能分析系统

获取原文
获取原文并翻译 | 示例
           

摘要

Background: Microarray technology allows to measure the expression of thousands of genes simultaneously, and under tens of specific conditions. Clustering and Biclustering are the main tools to analyze gene expression data obtained from microarray experiments. By grouping together genes with the same behavior across samples, relevant biological knowledge may be extracted. Non-exclusive groupings are required, since a gene may play more than one biological role. Gene Shaving [Hastie, T., et al. (2000). Gene Shaving as a method for identifying distinct sets of genes with similar expression. Genome Biology, J, 1-21 ] is a popular clustering algorithm which looks for coherent clusters of genes with high variance across samples, allowing overlapping among the clusters.rnMethod: In this paper, we present an intelligent system for analyzing microarray data. Our system implements three novel non-exclusive approaches for clustering and biclustering whose aim is to find coherent groups of genes with large between-sample variance: EDA-Clustering and EDA-Biclustering, based on Estimation of Distribution Algorithms (EDA), and Gene-&-Sample Shaving, a biclustering algorithm based on Principal Components Analysis.rnResults: We integrated the three proposed methods into a web-based platform and tested their performance on two real datasets. The obtained results outperform Gene Shaving in terms of quality and size of revealed patterns. Furthermore, our system allows to visualize the results and validate them from a biological point of view by means of the annotations of the Gene Ontology.
机译:背景:微阵列技术可在数十种特定条件下同时测量数千种基因的表达。聚类和双聚类分析是分析从微阵列实验获得的基因表达数据的主要工具。通过将样本中具有相同行为的基因分组在一起,可以提取相关的生物学知识。由于基因可能扮演多个生物学角色,因此需要非排他性分组。基因剃须[Hastie,T.,et al。 (2000)。基因剃须作为一种鉴定具有相似表达的不同基因集的方法。 Genome Biology,J,1-21]是一种流行的聚类算法,该算法寻找样本之间具有高变异性的相关连贯基因簇,从而允许这些簇之间重叠。rn方法:在本文中,我们提出了一种用于分析微阵列数据的智能系统。我们的系统实施了三种新颖的非排他性聚类和双聚类方法,其目的是寻找样本间差异较大的连贯基因组:EDA聚类和EDA双聚类(基于分布算法估算(EDA))以及Gene-结果:我们将三种建议的方法集成到基于Web的平台中,并在两个真实数据集上测试了它们的性能。获得的结果在显示模式的质量和大小方面胜过基因剃须。此外,我们的系统允许可视化结果并通过基因本体论的注释从生物学的角度验证结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号