首页> 外文期刊>BMC Bioinformatics >Query-based biclustering of gene expression data using Probabilistic Relational Models
【24h】

Query-based biclustering of gene expression data using Probabilistic Relational Models

机译:使用概率关系模型的基于查询的基因表达数据二类聚类

获取原文
           

摘要

Background With the availability of large scale expression compendia it is now possible to view own findings in the light of what is already available and retrieve genes with an expression profile similar to a set of genes of interest (i.e., a query or seed set) for a subset of conditions. To that end, a query-based strategy is needed that maximally exploits the coexpression behaviour of the seed genes to guide the biclustering, but that at the same time is robust against the presence of noisy genes in the seed set as seed genes are often assumed, but not guaranteed to be coexpressed in the queried compendium. Therefore, we developed Pro Bic, a query-based biclustering strategy based on Probabilistic Relational Models (PRMs) that exploits the use of prior distributions to extract the information contained within the seed set. Results We applied Pro Bic on a large scale Escherichia coli compendium to extend partially described regulons with potentially novel members. We compared Pro Bic's performance with previously published query-based biclustering algorithms, namely ISA and QDB, from the perspective of bicluster expression quality, robustness of the outcome against noisy seed sets and biological relevance. This comparison learns that Pro Bic is able to retrieve biologically relevant, high quality biclusters that retain their seed genes and that it is particularly strong in handling noisy seeds. Conclusions Pro Bic is a query-based biclustering algorithm developed in a flexible framework, designed to detect biologically relevant, high quality biclusters that retain relevant seed genes even in the presence of noise or when dealing with low quality seed sets.
机译:背景技术随着大规模表达纲要的可获得性,现在有可能根据已有的内容查看自己的发现,并检索具有与一组感兴趣的基因(即查询或种子集)相似的表达谱的基因条件的子集。为此,需要一种基于查询的策略,该策略可最大程度地利用种子基因的共表达行为来指导双聚类,但同时又能有效抵抗种子集中存在有噪点的基因,因为通常假定存在种子基因,但不能保证在查询的摘要中可以共表达。因此,我们开发了Pro Bic,这是一种基于概率关系模型(PRM)的基于查询的双聚类策略,该策略利用先验分布来提取种子集中包含的信息。结果我们在大规模大肠杆菌纲要上应用了Pro Bic,以扩展具有潜在新成员的部分描述的regulon。我们从bicluster表达质量,针对嘈杂种子集的结局稳健性和生物学相关性的角度,将Pro Bic的性能与以前发布的基于查询的biclustering算法(即ISA和QDB)进行了比较。通过比较发现,Pro Bic能够检索保留其种子基因的生物学相关的高质量双聚类,并且在处理嘈杂的种子方面特别强大。结论Pro Bic是在灵活框架中开发的基于查询的双聚类算法,旨在检测与生物学相关的高质量双聚类,即使在有噪声或处理低质量种子集的情况下,这些聚类仍保留相关的种子基因。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号