首页> 外文会议>European conference on machine learning and knowledge discovery in databases >Evidence-Based Clustering for Scalable Inference in Markov Logic
【24h】

Evidence-Based Clustering for Scalable Inference in Markov Logic

机译:马尔可夫逻辑中基于证据的可扩展推理聚类

获取原文

摘要

Markov Logic is a powerful representation that unifies first-order logic and probabilistic graphical models. However, scaling-up inference in Markov Logic Networks (MLNs) is extremely challenging. Standard graphical model inference algorithms operate on the prepositional Markov network obtained by grounding the MLN and do not scale well as the number of objects in the real-world domain increases. On the other hand, algorithms which perform inference directly at the first-order level, namely lifted inference algorithms, although more scalable than prepositional algorithms, require the MLN to have specific symmetric structure. Worse still, evidence breaks symmetries, and the performance of lifted inference is the same as prepositional inference (or sometimes worse, due to overhead). In this paper, we propose a general method for solving this "evidence" problem. The main idea in our method is to approximate the given MLN having, say, n objects by an MLN having k objects such that k («) n and the results obtained by running potentially much faster inference on the smaller MLN are as close as possible to the ones obtained by running inference on the larger MLN. We achieve this by finding clusters of "similar" groundings using standard clustering algorithms (e.g., K-means), and replacing all groundings in the cluster by their cluster center. To this end, we develop a novel distance (or similarity) function for measuring the similarity between two groundings, based on the evidence presented to the MLN. We evaluated our approach on many different benchmark MLNs utilizing various clustering and inference algorithms. Our experiments clearly show the generality and scalability of our approach.
机译:马尔可夫逻辑(Markov Logic)是一种强大的表示形式,它将一阶逻辑和概率图形模型结合在一起。但是,马尔可夫逻辑网络(MLN)中的按比例放大推理非常具有挑战性。标准图形模型推论算法在通过将MLN接地而获得的介词马尔可夫网络上运行,并且随着实际域中对象数量的增加而无法很好地扩展。另一方面,直接在第一级执行推理的算法(即提升的推理算法)尽管比介词算法更具可扩展性,但它要求MLN具有特定的对称结构。更糟糕的是,证据破坏了对称性,而推论的表现与介词推论相同(由于开销,有时会更糟)。在本文中,我们提出了一种解决该“证据”问题的通用方法。我们方法的主要思想是通过具有k个对象的MLN来近似给定的MLN(具有n个对象),使得k(«)n以及通过对较小的MLN进行可能更快的推断而获得的结果尽可能接近通过在较大的MLN上进行推理获得的结果。我们通过使用标准聚类算法(例如K-means)找到“相似”接地点的集群,并用其集群中心替换集群中的所有接地点,来实现这一目标。为此,基于提供给MLN的证据,我们开发了一种新颖的距离(或相似性)函数来测量两个接地之间的相似性。我们利用各种聚类和推理算法对许多不同基准MLN的方法进行了评估。我们的实验清楚地表明了我们方法的通用性和可扩展性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号