首页> 外文OA文献 >Multimodal Visual Concept Learning with Weakly Supervised Techniques
【2h】

Multimodal Visual Concept Learning with Weakly Supervised Techniques

机译:与弱监督技术的多式化视觉概念学习

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Despite the availability of a huge amount of video data accompanied bydescriptive texts, it is not always easy to exploit the information containedin natural language in order to automatically recognize video concepts. Towardsthis goal, in this paper we use textual cues as means of supervision,introducing two weakly supervised techniques that extend the Multiple InstanceLearning (MIL) framework: the Fuzzy Sets Multiple Instance Learning (FSMIL) andthe Probabilistic Labels Multiple Instance Learning (PLMIL). The former encodesthe spatio-temporal imprecision of the linguistic descriptions with Fuzzy Sets,while the latter models different interpretations of each description'ssemantics with Probabilistic Labels, both formulated through a convexoptimization algorithm. In addition, we provide a novel technique to extractweak labels in the presence of complex semantics, that consists of semanticsimilarity computations. We evaluate our methods on two distinct problems,namely face and action recognition, in the challenging and realistic setting ofmovies accompanied by their screenplays, contained in the COGNIMUSE database.We show that, on both tasks, our method considerably outperforms astate-of-the-art weakly supervised approach, as well as other baselines.
机译:尽管有大量的视频数据伴随着Bydesifice文本,但它并不总是容易利用自然语言的信息,以便自动识别视频概念。在本文中,我们将文本提示用作监督手段,引入了扩展多个InscanceLearning(MIL)框架的两个弱监管技术:模糊集多实例学习(FSMIL)和概率标签多实例学习(PlmiL)。前一种与模糊集的语言描述的时空视图,而后者模拟了通过凸透化算法配制的概率标签的每个描述的不同解释。此外,我们提供了一种新颖的技术,在存在复杂语义的存在下提取出来的标签,由语义化计算组成。我们在两个不同的问题中评估了我们的方法,即面部和行动识别,在伴随着他们的剧本剧本的挑战和现实的环境中,包含在Cognimuse Database中的剧本。我们在两个任务中表明,我们的方法非常胜过agrate的差价 - 弱势监督的方法以及其他基线。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号