首页> 外文会议>International Conference on Applied Mathematics and Approximation Theory >Transformations of Data in Deterministic Modelling of Biological Networks
【24h】

Transformations of Data in Deterministic Modelling of Biological Networks

机译:生物网络确定性建模中数据的转变

获取原文

摘要

The Gaussian graphical model (GGM) is a probabilistic modelling approach used in the system biology to represent the relationship between genes with an undirected graph. In graphical models, the genes and their interactions are denoted by nodes and the edges between nodes. Hereby, in this model, it is assumed that the structure of the system can be described by the inverse of the covariance matrix, Θ, which is also called as the precision, when the observations are formulated via a lasso regression under the multivariate normality assumption of states. There are several approaches to estimate Θ in GGM. The most well-known ones are the neighborhood selection algorithm and the graphical lasso (glasso) approach. On the other hand, the multivariate adaptive regression splines (MARS) is a non-parametric regression technique to model nonlinear and highly dependent data successfully. From previous simulation studies, it has been found that MARS can be a strong alternative of GGM if the model is constructed similar to a lasso model and the interaction terms in the optimal model are ignored to get comparable results with respect to the GGM findings. Moreover, it has been detected that the major challenge in both modelling approaches is the high sparsity of Θ due to the possible non-linear interactions between genes, in particular, when the dimensions of the networks are realistically large. In this study, as the novelty, we suggest the Bernstein operators, namely, Bernstein and Szasz polynomials, in the raw data before any lasso type of modelling and associated inference approaches. Because from the findings via GGM with small and moderately large systems, we have observed that the Bernstein polynomials can increase the accuracy of the estimates. Hence, in this work, we perform these operators firstly into the most well-known inference approaches used in GGM under realistically large networks. Then, we investigate the assessment of these transformations for the MARS modelling as the alternative of GGM again under the same large complexity. By this way, we aim to propose these transformation techniques for all sorts of modellings under the steady-state condition of the protein-protein interaction networks in order to get more accurate estimates without any computational cost. In the evaluation of the results, we compare the precision and F-measures of the simulated datasets.
机译:高斯图形模型(GGM)是在系统生物学用来表示与无向图的基因之间的关系的概率建模方法。在图形模型中,基因和它们之间的相互作用是由节点和节点之间的边表示。由此,在该模型中,假设该系统的结构可以通过协方差矩阵,Θ,这也被称为精度,当观察经由套索回归多变量正态假设下配制的倒数来描述的状态。有几个在GGM估计Θ方法。最为公知的是邻域选择算法和图形套索(glasso)的方法。在另一方面,多元自适应回归样条(MARS)是一种非参数回归技术,以模型非线性,并成功地高度依赖数据。从以前的模拟研究,已发现MARS可以GGM的替代性强的,如果模型构造类似于套索模型和优化模型的相互作用项被忽略得到关于GGM发现类似的结果。而且,它已被检测到,在这两种模型的主要挑战是接近Θ的高稀疏由于基因之间可能存在的非线性的相互作用,特别地,当网络的尺寸是大的现实。在这项研究中,如新颖性,我们建议Bernstein算,即伯恩斯坦和萨斯多项式,在原始数据之前的任何套索类型的建模和相关的推理方法。因为从通过GGM与小型和中等大小的系统的调查结果,我们已经观察到,伯恩斯坦多项式可以提高估值的准确性。因此,在这项工作中,我们执行这些操作员首先成为最知名的推理下,切实大型网络中GGM办法使用。然后,我们研究这些变化对MARS建模为GGM再次在同样大的复杂替代的评估。通过这种方式,我们的目标是为了获得更准确的估计,没有任何计算成本来提出各种的蛋白质 - 蛋白质相互作用网络的稳态条件下modellings这些转换技术。在结果的评价,​​我们比较的精度和模拟数据集的F-措施。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号