【24h】

Inductive Logic Programming for Gene Regulation Prediction

机译:用于基因调控预测的归纳逻辑编程

获取原文
获取原文并翻译 | 示例

摘要

In Table 1, a summary of our experimental results with single, bagged and boosted Tilde decision trees can be found. In the first row, the baseline accuracy of 54.7% indicates that the models clearly improve upon random guessing. The second row shows the reference result by Middendorf et al. [1], where alternating decision trees (ADTs) were applied to a propositional version of the basic data (without the additional predicates). We included the results both with and without the use of FunCat terms. Since the predicates related to protein-protein interactions are not frequently used in the trees, we omitted them altogether in the experiments presented here. Given the same information, boosted decision trees are on par with ADTs (another boosting technique), whereas FunCat terms substantially improve the performance, both in predictive accuracy and compactness. Moreover, it is possible to extract the functional categories affected by theexperimental conditions, together with important transcription factor binding sites and transcription factors for these categories from the induced trees. For a detailed description of the data and a qualitative discussion of the results, we have to refer to the long version of the paper [6].Summing up, we propose a systems biology application of ILP, where the goal is to predict the regulation of a gene under a certain condition from binding site information, the state of regulators, and additional information. We believe that decoding the regulation mechanisms of genes is an exciting new application of learning in logic, requiring data integration from various sources and potentially contributing to a better understanding on a system level.
机译:在表1中,可以找到我们的实验结果的摘要,其中包含单个,袋装和增强的Tilde决策树。在第一行中,基线准确度为54.7%,表明模型在随机猜测后明显改善。第二行显示Middendorf等人的参考结果。 [1],其中将交替决策树(ADT)应用于基本数据的命题版本(没有附加谓词)。无论是否使用FunCat术语,我们都包括了结果。由于与蛋白质-蛋白质相互作用相关的谓词在树中并不经常使用,因此我们在此处介绍的实验中将它们完全省略。给定相同的信息,增强的决策树与ADT(另一种增强技术)相提并论,而FunCat术语在预测准确性和紧凑性方面都大大提高了性能。此外,有可能从实验树中提取受实验条件影响的功能类别,以及这些类别的重要转录因子结合位点和转录因子。对于数据的详细描述和结果的定性讨论,我们必须参考较长的论文[6]。总之,我们提出了ILP的系统生物学应用,其目的是预测调节在一定条件下根据结合位点信息,调节子状态和其他信息确定基因的表达。我们认为,解码基因的调控机制是逻辑学习的令人兴奋的新应用,它需要来自各种来源的数据集成,并可能有助于在系统水平上更好地理解。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号