Query-based biclustering of gene expression data using Probabilistic Relational Models

Hui Zhao; Lore Cloots; Tim Van den Bulcke; Yan Wu; Riet De Smet; Valerie Storms; Pieter Meysman; Kristof Engelen; Kathleen Marchal

首页> 外文期刊>BMC Bioinformatics >Query-based biclustering of gene expression data using Probabilistic Relational Models

【24h】

Query-based biclustering of gene expression data using Probabilistic Relational Models

机译：使用概率关系模型的基于查询的基因表达数据二类聚类

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Background With the availability of large scale expression compendia it is now possible to view own findings in the light of what is already available and retrieve genes with an expression profile similar to a set of genes of interest (i.e., a query or seed set) for a subset of conditions. To that end, a query-based strategy is needed that maximally exploits the coexpression behaviour of the seed genes to guide the biclustering, but that at the same time is robust against the presence of noisy genes in the seed set as seed genes are often assumed, but not guaranteed to be coexpressed in the queried compendium. Therefore, we developed Pro Bic, a query-based biclustering strategy based on Probabilistic Relational Models (PRMs) that exploits the use of prior distributions to extract the information contained within the seed set. Results We applied Pro Bic on a large scale Escherichia coli compendium to extend partially described regulons with potentially novel members. We compared Pro Bic's performance with previously published query-based biclustering algorithms, namely ISA and QDB, from the perspective of bicluster expression quality, robustness of the outcome against noisy seed sets and biological relevance. This comparison learns that Pro Bic is able to retrieve biologically relevant, high quality biclusters that retain their seed genes and that it is particularly strong in handling noisy seeds. Conclusions Pro Bic is a query-based biclustering algorithm developed in a flexible framework, designed to detect biologically relevant, high quality biclusters that retain relevant seed genes even in the presence of noise or when dealing with low quality seed sets.

机译：背景技术随着大规模表达纲要的可获得性，现在有可能根据已有的内容查看自己的发现，并检索具有与一组感兴趣的基因（即查询或种子集）相似的表达谱的基因条件的子集。为此，需要一种基于查询的策略，该策略可最大程度地利用种子基因的共表达行为来指导双聚类，但同时又能有效抵抗种子集中存在有噪点的基因，因为通常假定存在种子基因，但不能保证在查询的摘要中可以共表达。因此，我们开发了Pro Bic，这是一种基于概率关系模型（PRM）的基于查询的双聚类策略，该策略利用先验分布来提取种子集中包含的信息。结果我们在大规模大肠杆菌纲要上应用了Pro Bic，以扩展具有潜在新成员的部分描述的regulon。我们从bicluster表达质量，针对嘈杂种子集的结局稳健性和生物学相关性的角度，将Pro Bic的性能与以前发布的基于查询的biclustering算法（即ISA和QDB）进行了比较。通过比较发现，Pro Bic能够检索保留其种子基因的生物学相关的高质量双聚类，并且在处理嘈杂的种子方面特别强大。结论Pro Bic是在灵活框架中开发的基于查询的双聚类算法，旨在检测与生物学相关的高质量双聚类，即使在有噪声或处理低质量种子集的情况下，这些聚类仍保留相关的种子基因。

著录项

来源
《BMC Bioinformatics》 |2011年第1期|共11页
作者
Hui Zhao; Lore Cloots; Tim Van den Bulcke; Yan Wu; Riet De Smet; Valerie Storms; Pieter Meysman; Kristof Engelen; Kathleen Marchal;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类生物科学;
关键词

相似文献

外文文献
中文文献
专利

1. A probabilistic coevolutionary biclustering algorithm for discovering coherent patterns in gene expression dataset [J] . Je-Gun Joung, Soo-Jin Kim, Soo-Yong Shin, BMC Bioinformatics . 2012,第SUPPLEMENTa17期

机译：在基因表达数据集中发现相干模式的概率协同进化双聚类算法
2. A probabilistic relaxation labeling framework for reducing the noise effect in geometric biclustering of gene expression data [J] . Zhao HY, Chan KL, Cheng LM, Pattern Recognition: The Journal of the Pattern Recognition Society . 2009,第11期

机译：一种概率松弛标记框架，可减少基因表达数据的几何聚类中的噪声影响
3. Biclustering of gene expression data using biclustering iterative signature algorithm and biclustering coherent column [J] . E. Saravana Kumar, K. Vengatesan, R.P. Singh, International journal of biomedical engineering and technology . 2018,第3a4期

机译：使用BICLustering迭代签名算法和双板相干柱的基因表达数据BICLUSTING
4. Query-based biclustering of gene expression data using Probabilistic Relational Models [C] . Hui Zhao, Lore Cloots, Tim Van den Bulcke, Asia-Pacific Bioinformatics Conference . 2012

机译：基于概率关系模型的基于基于基因表达数据的BICLUSTING
5. Relevance-dependent Biclustering Models for Relational Data Analysis [D] . 大濵, 郁 2019

机译：关系数据分析的基于相关性的分类模型
6. Query-based biclustering of gene expression data using Probabilistic Relational Models [O] . Hui Zhao, Lore Cloots, Tim Van den Bulcke, 2011

机译：使用概率关系模型的基于查询的基因表达数据的聚类
7. Query-based biclustering of gene expression data using Probabilistic Relational Models [O] . Zhao, Hui, Cloots, Lore, Van den Bulcke, Tim, 2011

机译：使用概率关系模型的基于查询的基因表达数据的聚类

Query-based biclustering of gene expression data using Probabilistic Relational Models

摘要

著录项

相似文献

相关主题

期刊订阅