首页> 外文会议>International conference on inductive logic programming >Logical Vision: One-Shot Meta-Interpretive Learning from Real Images
【24h】

Logical Vision: One-Shot Meta-Interpretive Learning from Real Images

机译:逻辑视觉:从真实图像中进行一次元诠释学习

获取原文

摘要

Statistical machine learning is widely used in image classification. However, most techniques (1) require many images to achieve high accuracy and (2) do not provide support for reasoning below the level of classification, and so are unable to support secondary reasoning, such as the existence and position of light sources and other objects outside the image. In recent work an Inductive Logic Programming approach called Logical Vision (LV) was shown to overcome some of these limitations. LV uses Meta-Interpretive Learning combined with low-level extraction of high-contrast points sampled from the image to learn recursive logic programs describing the image. This paper extends LV by using (a) richer background knowledge enabling secondary reasoning from raw images, such as light reflection that can itself be learned and used for resolving visual ambiguities, which cannot be easily modelled using statistical approaches, (b) a wider class of background models representing classical 2D shapes such as circles and ellipses, (c) primitive-level statistical estimators to handle noise in real images. Our results indicate that the new noise-robust version of LV is able to handle secondary reasoning task in real images with few data, which is very similar to scientific discovery process of humans. Specifically, it uses a single example (i.e. one-shot LV) converges to an accuracy at least comparable to thirty-shot statistical machine learner on the prediction of hidden light sources. Moreover, we demonstrate that the learned theory can be used to identify ambiguities in the convexity/concavity of objects such as craters.
机译:统计机器学习广泛用于图像分类。但是,大多数技术(1)需要许多图像才能实现高精度,并且(2)在分类级别以下时不提供推理支持,因此无法支持辅助推理,例如光源和其他光源的存在和位置。图像外的物体。在最近的工作中,展示了一种称为逻辑视觉(LV)的归纳逻辑编程方法,可以克服其中的一些局限性。 LV将元解释学习与从图像中采样的高对比度点的低级提取相结合,以学习描述图像的递归逻辑程序。本文通过使用(a)丰富的背景知识来扩展LV,(a)可以从原始图像中进行二次推理,例如光反射本身可以被学习并用于解决视觉歧义,而使用统计方法无法轻松地对其建模,(b)类别更广代表经典2D形状(例如圆形和椭圆形)的背景模型;(c)用于处理真实图像中噪声的原始级统计估计器。我们的结果表明,新的鲁棒抗噪版本的LV能够以很少的数据处理真实图像中的次要推理任务,这与人类的科学发现过程非常相似。具体而言,它使用单个示例(即,单次LV)收敛到至少与三十次统计机器学习者在隐藏光源的预测上可比的精度。此外,我们证明了学习的理论可以用于识别物体(例如陨石坑)的凸度/凹度中的歧义。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号