首页> 外文会议>European conference on machine learning and principles and practice of knowledge discovery in databases >A Constrained 11 Minimization Approach for Estimating Multiple Sparse Gaussian or Nonparanormal Graphical Models
【24h】

A Constrained 11 Minimization Approach for Estimating Multiple Sparse Gaussian or Nonparanormal Graphical Models

机译:用于估计多个稀疏高斯或非超自然图形模型的约束11最小化方法

获取原文

摘要

Identifying context-specific entity networks from aggregated data is an important task, arising often in bioinformatics and neuroimaging applications. Computationally, this task can be formulated as jointly estimating multiple different, but related, sparse undirected graphical models (UGM) from aggregated samples across several contexts. Previous joint-UGM studies have mostly focused on sparse Gaussian graphical models (sGGMs) and can't identify context-specific edge patterns directly. We, therefore, propose a novel approach, SIMULE (detecting Shared and Individual parts of MULtiple graphs Explicitly) to learn multi-UGM via a constrained 11 minimization. SIMULE automatically infers both specific edge patterns that are unique to each context and shared interactions preserved among all the contexts. Through the 11 constrained formulation, this problem is cast as multiple independent subtasks of linear programming that can be solved efficiently in parallel. In addition to Gaussian data, SIMULE can also handle multivariate Nonparanormal data that greatly relaxes the normality assumption that many real-world applications do not follow. We provide a novel theoretical proof showing that SIMULE achieves a consistent result at the rate O (log(Kp)tot). On multiple synthetic datasets and two biomedical datasets, SIMULE shows significant improvement over state-of-the-art multi-sGGM and single-UGM baselines (SIMULE implementation and the used datasets @https://github.com/QData/ SIMULE).
机译:从聚合数据中识别特定于上下文的实体网络是一项重要任务,通常是在生物信息学和神经影像学应用中出现的。从计算上讲,此任务可以公式化为从多个环境中的合计样本共同估计多个不同但相关的稀疏无向图形模型(UGM)。先前的UGM联合研究主要集中在稀疏的高斯图形模型(sGGM),无法直接识别特定于上下文的边缘模式。因此,我们提出了一种新颖的方法SIMULE(明确检测MULtiple图的共享部分和单个部分),以通过约束11最小化来学习多个UGM。 SIMULE会自动推断每个上下文唯一的特定边缘模式,以及所有上下文之间保留的共享交互。通过11个受约束的公式,此问题被转换为线性规划的多个独立子任务,这些任务可以并行有效地解决。除高斯数据外,SIMULE还可以处理多元非超自然数据,这大大放松了许多现实世界应用程序未遵循的正态性假设。我们提供了一种新颖的理论证据,表明SIMULE以O(log(Kp)/ ntot)的速率获得了一致的结果。在多个合成数据集和两个生物医学数据集上,SIMULE相对于最新的多sGGM和单个UGM基线(SIMULE实现和使用的数据集@https://github.com/QData/ SIMULE)显示出显着改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号