首页> 外文期刊>Pattern Analysis and Machine Intelligence, IEEE Transactions on >Learning Multivariate Distributions by Competitive Assembly of Marginals
【24h】

Learning Multivariate Distributions by Competitive Assembly of Marginals

机译:通过边际竞争装配学习多元分布

获取原文
获取原文并翻译 | 示例
           

摘要

We present a new framework for learning high-dimensional multivariate probability distributions from estimated marginals. The approach is motivated by compositional models and Bayesian networks, and designed to adapt to small sample sizes. We start with a large, overlapping set of elementary statistical building blocks, or “primitives,” which are low-dimensional marginal distributions learned from data. Each variable may appear in many primitives. Subsets of primitives are combined in a Lego-like fashion to construct a probabilistic graphical model; only a small fraction of the primitives will participate in any valid construction. Since primitives can be precomputed, parameter estimation and structure search are separated. Model complexity is controlled by strong biases; we adapt the primitives to the amount of training data and impose rules which restrict the merging of them into allowable compositions. The likelihood of the data decomposes into a sum of local gains, one for each primitive in the final structure. We focus on a specific subclass of networks which are binary forests. Structure optimization corresponds to an integer linear program and the maximizing composition can be computed for reasonably large numbers of variables. Performance is evaluated using both synthetic data and real datasets from natural language processing and computational biology.
机译:我们提出了一个新的框架,用于从估计的边际中学习高维多元概率分布。该方法受成分模型和贝叶斯网络的激励,旨在适应小样本量。我们从大量重叠的基本统计构建块(即“原始”)开始,这是从数据中学到的低维边际分布。每个变量可能出现在许多基元中。基元的子集以类似于Lego的方式组合以构建概率图形模型;只有一小部分原语会参与任何有效的构造。由于可以预先计算基元,因此参数估计和结构搜索是分开的。模型的复杂性受强大的偏见控制。我们使原语适应训练数据的数量,并施加规则以限制将原语合并为可允许的成分。数据的可能性分解为局部增益的总和,对于最终结构中的每个基元,局部增益就一个。我们专注于网络的特定子类,即二进制森林。结构优化对应于整数线性程序,并且可以为大量变量计算最大组成。使用合成数据和来自自然语言处理和计算生物学的真实数据集来评估性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号