首页> 外文会议>Uncertainty in Artificial Intelligence >Interpolating Conditional Density Trees
【24h】

Interpolating Conditional Density Trees

机译:插值条件密度树

获取原文
获取外文期刊封面目录资料

摘要

Joint distributions over many variables are frequently modeled by decomposing them into products of simpler, lower-dimensional conditional distributions, such as in sparsely connected Bayesian networks. However, automatically learning such models can be very computationally expensive when there are many datapoints and many continuous variables with complex nonlinear relationships, particularly when no good ways of decomposing the joint distribution are known o priori. In such situations, previous research has generally focused on the use of discretization techniques in which each continuous variable has a single discretization that is used throughout the entire network. In this paper, we present and compare a wide variety of tree-based algorithms for learning and evaluating conditional density estimates over continuous variables. These trees can be thought of as discretizations that vary according to the particular interactions being modeled; however, the density within a given leaf of the tree need not be assumed constant, and we show that such nonuniform leaf densities lead to more accurate density estimation. We have developed Bayesian network structure-learning algorithms that employ these tree-based conditional density representations, and we show that they can be used to practically learn complex joint probability models over dozens of continuous variables from thousands of datapoints. We focus on finding models that are simultaneously accurate, fast to learn, and fast to evaluate once they are learned.
机译:通常通过将多个变量分解为更简单的低维条件分布的乘积(例如在稀疏连接的贝叶斯网络中)来对多个变量的联合分布进行建模。但是,当有许多数据点和许多具有复杂非线性关系的连续变量时,尤其是在没有已知的分解联合分布的好方法的情况下,自动学习这种模型可能在计算上非常昂贵。在这种情况下,以前的研究通常集中在离散化技术的使用上,其中每个连续变量都具有在整个网络中使用的单个离散化。在本文中,我们提出并比较了多种基于树的算法,用于学习和评估连续变量上的条件密度估计。可以将这些树视为离散化,这些离散化根据所建模的特定交互而有所不同。但是,树的给定叶子内的密度不必假定为常数,并且我们表明,这种不均匀的叶子密度可导致更准确的密度估计。我们已经开发了使用这些基于树的条件密度表示的贝叶斯网络结构学习算法,并且我们证明了它们可以用于从数以千计的数据点上的数十个连续变量中实际学习复杂的联合概率模型。我们专注于找到模型,这些模型同时准确,易于学习并且一旦学习就可以快速评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号