...
首页> 外文期刊>IEEE Transactions on Pattern Analysis and Machine Intelligence >Write a Classifier: Predicting Visual Classifiers from Unstructured Text
【24h】

Write a Classifier: Predicting Visual Classifiers from Unstructured Text

机译:编写分类器:根据非结构化文本预测视觉分类器

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

People typically learn through exposure to visual concepts associated with linguistic descriptions. For instance, teaching visual object categories to children is often accompanied by descriptions in text or speech. In a machine learning context, these observations motivates us to ask whether this learning process could be computationally modeled to learn visual classifiers. More specifically, the main question of this work is how to utilize purely textual description of visual classes with no training images, to learn explicit visual classifiers for them. We propose and investigate two baseline formulations, based on regression and domain transfer, that predict a linear classifier. Then, we propose a new constrained optimization formulation that combines a regression function and a knowledge transfer function with additional constraints to predict the parameters of a linear classifier. We also propose a generic kernelized models where a kernel classifier is predicted in the form defined by the representer theorem. The kernelized models allow defining and utilizing any two Reproducing Kernel Hilbert Space (RKHS) kernel functions in the visual space and text space, respectively. We finally propose a kernel function between unstructured text descriptions that builds on distributional semantics, which shows an advantage in our setting and could be useful for other applications. We applied all the studied models to predict visual classifiers on two fine-grained and challenging categorization datasets (CU Birds and Flower Datasets), and the results indicate successful predictions of our final model over several baselines that we designed.
机译:人们通常通过接触与语言描述有关的视觉概念来学习。例如,向儿童讲授视觉对象类别通常伴随着文字或语音描述。在机器学习的上下文中,这些观察结果促使我们提出以下问题:是否可以对这种学习过程进行建模以学习视觉分类器。更具体地说,这项工作的主要问题是如何在没有训练图像的情况下利用视觉课的纯文本描述,为他们学习显式的视觉分类器。我们提出并研究了两种基于线性回归和回归的基线公式,它们预测了线性分类器。然后,我们提出了一种新的约束优化公式,该公式将回归函数和知识传递函数与附加约束结合在一起,以预测线性分类器的参数。我们还提出了一个通用的内核化模型,其中以分类器定理定义的形式预测了一个内核分类器。内核化模型允许分别在可视空间和文本空间中定义和利用任何两个再现内核希尔伯特空间(RKHS)内核函数。最后,我们提出了一个基于分布语义的非结构化文本描述之间的内核函数,该函数在我们的设置中显示出优势,并且可能对其他应用程序有用。我们将所有研究的模型应用于两个细粒度且具有挑战性的分类数据集(CU鸟类和花卉数据集)上的视觉分类器预测,结果表明我们在设计的多个基准上成功预测了最终模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号