Exploiting Privileged Information from Web Data for Action and Event Recognition

Niu Li; Li Wen; Xu Dong

首页> 外文期刊>International Journal of Computer Vision >Exploiting Privileged Information from Web Data for Action and Event Recognition

【24h】

Exploiting Privileged Information from Web Data for Action and Event Recognition

机译：利用Web数据中的特权信息进行操作和事件识别

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In the conventional approaches for action and event recognition, sufficient labelled training videos are generally required to learn robust classifiers with good generalization capability on new testing videos. However, collecting labelled training videos is often time consuming and expensive. In this work, we propose new learning frameworks to train robust classifiers for action and event recognition by using freely available web videos as training data. We aim to address three challenging issues: (1) the training web videos are generally associated with rich textual descriptions, which are not available in test videos; (2) the labels of training web videos are noisy and may be inaccurate; (3) the data distributions between training and test videos are often considerably different. To address the first two issues, we propose a new framework called multi-instance learning with privileged information (MIL-PI) together with three new MIL methods, in which we not only take advantage of the additional textual descriptions of training web videos as privileged information, but also explicitly cope with noise in the loose labels of training web videos. When the training and test videos come from different data distributions, we further extend our MIL-PI as a new framework called domain adaptive MIL-PI. We also propose another three new domain adaptation methods, which can additionally reduce the data distribution mismatch between training and test videos. Comprehensive experiments for action and event recognition demonstrate the effectiveness of our proposed approaches.

机译：在用于动作和事件识别的常规方法中，通常需要足够的带标签的训练视频来学习在新的测试视频上具有良好泛化能力的鲁棒分类器。但是，收集带有标签的培训视频通常很耗时且昂贵。在这项工作中，我们提出了新的学习框架，通过使用免费提供的网络视频作为训练数据来训练用于动作和事件识别的强大分类器。我们的目标是解决三个具有挑战性的问题：（1）培训网络视频通常与丰富的文字说明相关联，而测试视频中没有这些内容; （2）培训网络视频的标签嘈杂，可能不准确; （3）培训视频和测试视频之间的数据分配通常存在很大差异。为了解决前两个问题，我们提出了一个新的框架，即具有特权信息的多实例学习（MIL-PI）和三种新的MIL方法，其中我们不仅利用了特权培训网络视频的其他文字描述，信息，但也可以明确应对培训网络视频的松散标签中的噪音。当培训和测试视频来自不同的数据分布时，我们进一步将MIL-PI扩展为称为域自适应MIL-PI的新框架。我们还提出了另外三种新的领域自适应方法，它们可以另外减少训练视频和测试视频之间的数据分配不匹配。动作和事件识别的综合实验证明了我们提出的方法的有效性。

著录项

来源
《International Journal of Computer Vision 》 |2016年第2期| 共21页
作者
Niu Li; Li Wen; Xu Dong;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术 ;
关键词
Learning using privileged information; Multi-instance learning; Domain adaptation; Action recognition; Event recognition;

机译：使用特权信息进行学习;多实例学习;域自适应;动作识别;事件识别;

相似文献

外文文献
中文文献
专利

1. Exploiting Privileged Information from Web Data for Action and Event Recognition [J] . Niu Li, Li Wen, Xu Dong International Journal of Computer Vision . 2016 ,第2期

机译：利用Web数据中的特权信息进行操作和事件识别
2. Exploring privileged information from simple actions for complex action recognition [J] . Neurocomputing . 2020 ,第Mara7期

机译：探索来自简单动作的特权信息以进行复杂动作识别
3. Exploiting deep residual networks for human action recognition from skeletal data [J] . Huy-Hieu Pham, Khoudour Louandi, Crouzil Alain, Computer vision and image understanding . 2018 ,第MAY期

机译：利用深度残差网络从骨骼数据中识别人类动作
4. Exploiting Privileged Information from Web Data for Image Categorization [C] . Wen Li, Li Niu, Dong Xu European conference on computer vision . 2014

机译：利用Web数据中的特权信息进行图像分类
5. Exploiting the gap between human and machine abilities in handwriting recognition for Web security applications [D] . Rusu, Amalia 2007

机译：挖掘Web安全应用程序的手写识别中人与机器能力之间的差距
6. AGRIS: providing access to agricultural research data exploiting open data on the web [O] . Fabrizio Celli, Thembani Malapela, Karna Wegner, -1

机译：AGRIS：利用网络上的开放数据提供对农业研究数据的访问
7. Exploiting web images for event recognition in consumer videos: A multiple source domain adaptation approach [O] . Lixin Duan, Dong Xu, Shih-fu Chang 2012

机译：利用网络图像进行消费者视频中的事件识别：一种多源域自适应方法
8. Self-Contained Information Resource (SCIR) for Automated Real-time Data Acquisition, Data Archival, Data Analysis, and Data Exploitation of Ground Truth Radiometric Signatures from Scaled Ordnance (SCALO) High Explosive Events [R] . Boye, L. , Herther, T. , Harris, C. , 1999

机译：自包含信息资源（sCIR），用于自动实时数据采集，数据存档，数据分析以及来自规模化军械（sCaLO）高爆炸事件的地面真实辐射特征的数据利用

Exploiting Privileged Information from Web Data for Action and Event Recognition

摘要

著录项

相似文献

相关主题

期刊订阅