首页> 外文期刊>Advances in Experimental Medicine and Biology >Predictive minimum description length principle approach to inferring gene regulatory networks
【24h】

Predictive minimum description length principle approach to inferring gene regulatory networks

机译:推断基因调控网络的预测最小描述长度原理方法

获取原文
获取原文并翻译 | 示例
       

摘要

Reverse engineering of gene regulatory networks using information theory models has received much attention due to its simplicity, low computational cost, and capability of inferring large networks. One of the major problems with information theory models is to determine the threshold that defines the regulatory relationships between genes. The minimum description length (MDL) principle has been implemented to overcome this problem. The description length of the MDL principle is the sum of model length and data encoding length. A user-specified fine tuning parameter is used as control mechanism between model and data encoding, but it is difficult to find the optimal parameter. In this work, we propose a new inference algorithm that incorporates mutual information (MI), conditional mutual information (CMI), and predictive minimum description length (PMDL) principle to infer gene regulatory networks from DNA microarray data. In this algorithm, the information theoretic quantities MI and CMI determine the regulatory relationships between genes and the PMDL principle method attempts to determine the best MI threshold without the need of a user-specified fine tuning parameter. The performance of the proposed algorithm is evaluated using both synthetic time series data sets and a biological time series data set (Saccharomyces cerevisiae). The results show that the proposed algorithm produced fewer false edges and significantly improved the precision when compared to existing MDL algorithm.
机译:使用信息论模型进行基因调控网络的逆向工程由于其简单性,低计算成本和推断大型网络的能力而备受关注。信息理论模型的主要问题之一是确定定义基因之间调节关系的阈值。最小描述长度(MDL)原理已被实现以克服此问题。 MDL原理的描述长度是模型长度和数据编码长度的总和。用户指定的微调参数用作模型和数据编码之间的控制机制,但是很难找到最佳参数。在这项工作中,我们提出了一种新的推理算法,该算法结合了互信息(MI),条件互信息(CMI)和预测性最小描述长度(PMDL)原理,可以从DNA微阵列数据推断基因调控网络。在该算法中,信息理论量MI和CMI确定基因之间的调节关系,而PMDL原理方法尝试确定最佳MI阈值,而无需用户指定的微调参数。使用合成时间序列数据集和生物时间序列数据集(酿酒酵母)来评估所提出算法的性能。结果表明,与现有的MDL算法相比,该算法产生的虚假边缘更少,精度大大提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号