Multimodal Visual Concept Learning with Weakly Supervised Techniques

机译：与弱监督技术的多式化视觉概念学习

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Despite the availability of a huge amount of video data accompanied bydescriptive texts, it is not always easy to exploit the information containedin natural language in order to automatically recognize video concepts. Towardsthis goal, in this paper we use textual cues as means of supervision,introducing two weakly supervised techniques that extend the Multiple InstanceLearning (MIL) framework: the Fuzzy Sets Multiple Instance Learning (FSMIL) andthe Probabilistic Labels Multiple Instance Learning (PLMIL). The former encodesthe spatio-temporal imprecision of the linguistic descriptions with Fuzzy Sets,while the latter models different interpretations of each description'ssemantics with Probabilistic Labels, both formulated through a convexoptimization algorithm. In addition, we provide a novel technique to extractweak labels in the presence of complex semantics, that consists of semanticsimilarity computations. We evaluate our methods on two distinct problems,namely face and action recognition, in the challenging and realistic setting ofmovies accompanied by their screenplays, contained in the COGNIMUSE database.We show that, on both tasks, our method considerably outperforms astate-of-the-art weakly supervised approach, as well as other baselines.

机译：尽管有大量的视频数据伴随着Bydesifice文本，但它并不总是容易利用自然语言的信息，以便自动识别视频概念。在本文中，我们将文本提示用作监督手段，引入了扩展多个InscanceLearning（MIL）框架的两个弱监管技术：模糊集多实例学习（FSMIL）和概率标签多实例学习（PlmiL）。前一种与模糊集的语言描述的时空视图，而后者模拟了通过凸透化算法配制的概率标签的每个描述的不同解释。此外，我们提供了一种新颖的技术，在存在复杂语义的存在下提取出来的标签，由语义化计算组成。我们在两个不同的问题中评估了我们的方法，即面部和行动识别，在伴随着他们的剧本剧本的挑战和现实的环境中，包含在Cognimuse Database中的剧本。我们在两个任务中表明，我们的方法非常胜过agrate的差价 - 弱势监督的方法以及其他基线。

著录项

作者
Giorgos Bouritsas; Petros Koutras; Athanasia Zlatintsi; Petros Maragos;
展开▼
作者单位

展开▼
年度 2018
总页数
原文格式 PDF
正文语种
中图分类
入库时间 2022-08-20 21:56:44

相似文献

外文文献
中文文献
专利

1. Long-Term Incremental Web-Supervised Learning of Visual Concepts via Random Savannas [J] . Ewerth R., Ballafkir K., Muhling M., Multimedia, IEEE Transactions on . 2012,第4期

机译：通过随机稀树草原的长期增量式网络指导的视觉概念学习
2. Supervised Machine Learning Algorithms for Bioelectromagnetics: Prediction Models and Feature Selection Techniques Using Data from Weak Radiofrequency Radiation Effect on Human and Animals Cells [J] . Malka N. Halgamuge International Journal of Environmental Research and Public Health . 2020,第12期

机译：生物电磁学监督机学习算法：预测模型和使用弱射频辐射辐射效应数据的特征选择技术
3. A supervised multimodal search re-ranking technique using visual semantics [J] . Nikhila T. Bhuvan, M. Sudheep Elayidom International Journal of Intelligent Enterprise . 2020,第1a2a3期

机译：使用视觉语义的监督多峰搜索重新排序技术
4. Multimodal Visual Concept Learning with Weakly Supervised Techniques [C] . Giorgos Bouritsas, Petros Koutras, Athanasia Zlatintsi, IEEE/CVF Conference on Computer Vision and Pattern Recognition . 2018

机译：弱监督技术的多模式视觉概念学习
5. Visual Learning with Weak Supervision [D] . Cicek, Bayram Safa. 2021

机译：弱势监督视觉学习
6. Supervised Machine Learning Algorithms for Bioelectromagnetics: Prediction Models and Feature Selection Techniques Using Data from Weak Radiofrequency Radiation Effect on Human and Animals Cells [O] . Malka N. Halgamuge 2020

机译：生物电磁学的有监督机器学习算法：使用弱射频辐射对人和动物细胞的数据预测模型和特征选择技术
7. Not only Look, But Also Listen: Learning Multimodal Violence Detection Under Weak Supervision [O] . Peng Wu, Jing Liu, Yujia Shi, 2020

机译：不仅看，还要听：在弱势监督下学习多峰暴力检测

Multimodal Visual Concept Learning with Weakly Supervised Techniques

摘要

著录项

相似文献

相关主题

期刊订阅