...
首页> 外文期刊>In silico biology: An international on computational biology >Clustering formal concepts to discover biologically relevant knowledge from gene expression data
【24h】

Clustering formal concepts to discover biologically relevant knowledge from gene expression data

机译:聚类形式概念以从基因表达数据中发现生物学相关知识

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

The production of high-throughput gene expression data has generated a crucial need for bioinformatics tools to generate biologically interesting hypotheses. Whereas many tools are available for extracting global patterns, less attention has been focused on local pattern discovery. We propose here an original way to discover knowledge from gene expression data by means of the so-called formal concepts which hold in derived Boolean gene expression datasets. We first encoded the over-expression properties of genes in human cells using human SAGE data. It has given rise to a Boolean matrix from which we extracted the complete collection of formal concepts, i.e., all the largest sets of over-expressed genes associated to a largest set of biological situations in which their over-expression is observed. Complete collections of such patterns tend to be huge. Since their interpretation is a time-consuming task, we propose a new method to rapidly visualize clusters of formal concepts. This designates a reasonable number of Quasi-Synexpression-Groups (QSGs) for further analysis. The interest of our approach is illustrated using human SAGE data and interpreting one of the extracted QSGs. The assessment of its biological relevancy leads to the formulation of both previously proposed and new biological hypotheses.
机译:高通量基因表达数据的产生已经迫切需要生物信息学工具来产生生物学上有趣的假设。尽管有许多工具可用于提取全局模式,但对本地模式发现的关注较少。我们在这里提出一种原始方法,该方法通过所谓的形式概念从基因表达数据中发现知识,这些概念包含在布尔布尔基因表达数据集中。我们首先使用人类SAGE数据编码人类细胞中基因的过表达特性。它产生了一个布尔矩阵,从中我们提取了形式概念的完整集合,即与最大的一组生物学情况相关的所有最大的过表达基因集,其中观察到它们的过表达。这种模式的完整集合往往非常庞大。由于对它们的解释是一项耗时的任务,因此我们提出了一种新方法来快速可视化形式概念的集群。这指定了合理数量的准联指组(QSG)供进一步分析。使用人类SAGE数据并解释提取的QSG之一说明了我们方法的兴趣。对它的生物学相关性的评估导致提出了先前提出的和新的生物学假设。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号