首页> 外文期刊>BMC Bioinformatics >High-order dynamic Bayesian Network learning with hidden common causes for causal gene regulatory network
【24h】

High-order dynamic Bayesian Network learning with hidden common causes for causal gene regulatory network

机译:具有因果基因调节网络隐性常见原因的高阶动态贝叶斯网络学习

获取原文
           

摘要

Inferring gene regulatory network (GRN) has been an important topic in Bioinformatics. Many computational methods infer the GRN from high-throughput expression data. Due to the presence of time delays in the regulatory relationships, High-Order Dynamic Bayesian Network (HO-DBN) is a good model of GRN. However, previous GRN inference methods assume causal sufficiency, i.e. no unobserved common cause. This assumption is convenient but unrealistic, because it is possible that relevant factors have not even been conceived of and therefore un-measured. Therefore an inference method that also handles hidden common cause(s) is highly desirable. Also, previous methods for discovering hidden common causes either do not handle multi-step time delays or restrict that the parents of hidden common causes are not observed genes. We have developed a discrete HO-DBN learning algorithm that can infer also hidden common cause(s) from discrete time series expression data, with some assumptions on the conditional distribution, but is less restrictive than previous methods. We assume that each hidden variable has only observed variables as children and parents, with at least two children and possibly no parents. We also make the simplifying assumption that children of hidden variable(s) are not linked to each other. Moreover, our proposed algorithm can also utilize multiple short time series (not necessarily of the same length), as long time series are difficult to obtain. We have performed extensive experiments using synthetic data on GRNs of size up to 100, with up to 10 hidden nodes. Experiment results show that our proposed algorithm can recover the causal GRNs adequately given the incomplete data. Using the limited real expression data and small subnetworks of the YEASTRACT network, we have also demonstrated the potential of our algorithm on real data, though more time series expression data is needed.
机译:推断基因调控网络(GRN)已成为生物信息学中的重要主题。许多计算方法从高通量表达数据推断出GRN。由于监管关系中存在时间延迟,因此高阶动态贝叶斯网络(HO-DBN)是GRN的良好模型。但是,先前的GRN推断方法假定因果关系,即没有未发现的常见原因。这种假设是方便的,但却是不现实的,因为可能甚至还没有想到相关因素,因此无法衡量。因此,非常需要一种能够处理隐藏的常见原因的推理方法。同样,用于发现隐藏的常见原因的先前方法或者不处理多步时间延迟,或者限制了隐藏的常见原因的父母没有被观察到的基因。我们已经开发了一种离散的HO-DBN学习算法,该算法还可以从离散的时间序列表达式数据中推断出隐藏的常见原因,并且对条件分布进行了一些假设,但与以前的方法相比,其限制性较小。我们假设每个隐藏变量仅观察到变量为孩子和父母,至少有两个孩子,可能没有父母。我们还做出简化的假设,即隐藏变量的子代未相互链接。此外,由于难以获得长时间序列,因此我们提出的算法还可以利用多个短时间序列(不一定具有相同的长度)。我们已经使用合成数据对GRN进行了广泛的实验,这些GRN的大小最大为100,最多有10个隐藏节点。实验结果表明,在数据不完整的情况下,本文提出的算法可以适当地恢复因果关系。使用有限的真实表达数据和YEASTRACT网络的小型子网,我们还展示了我们算法在真实数据上的潜力,尽管需要更多的时间序列表达数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号