Deep Learning-Based Video Retrieval Using Object Relationships and Associated Audio Classes

机译：使用对象关系和关联的音频类进行基于深度学习的视频检索

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper introduces a video retrieval tool for the 2020 Video Browser Showdown (VBS). The tool enhances the user's video browsing experience by ensuring full use of video analysis database constructed prior to the Showdown. Deep learning based object detection, scene text detection, scene color detection, audio classification and relation detection with scene graph generation methods have been used to construct the data. The data is composed of visual, textual, and auditory information, broadening the scope to which a user can search beyond visual information. In addition, the tool provides a simple and user-friendly interface for novice users to adapt to the tool in little time.

机译：本文介绍了2020 Video Browser Showdown（VBS）的视频检索工具。该工具通过确保充分利用在Showdown之前构建的视频分析数据库来增强用户的视频浏览体验。基于深度学习的对象检测，场景文本检测，场景颜色检测，音频分类和带有场景图生成方法的关系检测已用于构建数据。数据由视觉，文本和听觉信息组成，从而扩大了用户可以搜索的范围，超出了视觉信息。此外，该工具还为新手用户提供了一个简单易用的界面，使他们可以在短时间内适应该工具。

著录项

来源
《International Conference on Multimedia Modeling》|2020年|803-808|共6页
会议地点
作者
Byoungjun Kim; Ji Yea Shim; Minho Park; Yong Man Ro;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Scene graph; Scene text; Audio classification;

机译：场景图;场景文字;音频分类;
入库时间 2022-08-26 13:55:05

相似文献

外文文献
中文文献
专利

1. 基于类关联关系的源代码重构时机识别 [J] . 刘伟, 杨娜, 黄辛迪, 中南大学学报（英文版） . 2020,第012期
2. Multiple deep features learning for object retrieval in surveillance videos [J] . Haiyun Guo, Jinqiao Wang, Hanqing Lu Computer Vision, IET . 2016,第4期

机译：多种深度特征学习，用于监控视频中的对象检索
3. Video retrieval based on hierarchy of spatio-temporal relationship among moving objects [J] . Kentaro Ueda, Atsuo Yoshitaka 電子情報通信学会技術研究報告. パターン認識·メディア理解. Pattern Recognition and Media Understanding . 2002,第652期

机译：基于运动对象时空关系分层的视频检索
4. Video retrieval based on hierarchy of spatio-temporal relationship among moving objects [J] . Kentaro Ueda, Atsuo Yoshitaka 電子情報通信学会技術研究報告. パターン認識·メディア理解. Pattern Recognition and Media Understanding . 2002,第652期

机译：基于移动对象之间的时空关系层次结构的视频检索
5. Deep Learning-Based Video Retrieval Using Object Relationships and Associated Audio Classes [C] . Byoungjun Kim, Ji Yea Shim, Minho Park, International Conference on Multimedia Modeling . 2020

机译：基于深度学习的视频检索使用对象关系和关联的音频类
6. Object Recognition in Videos Utilizing Hierarchical and Temporal Objectness with Deep Neural Networks. [D] . Peng, Liang. 2017

机译：利用具有深度神经网络的分层和时间对象性的视频中的对象识别。
7. CoMB-Deep: Composite Deep Learning-Based Pipeline for Classifying Childhood Medulloblastoma and Its Classes [O] . Omneya Attallah 2021

机译：梳深：基于综合的深度学习型管道用于分类儿童Medulloblastoma及其课程
8. Automatic Recording of the Target Location During Smooth Pursuit Eye Movement Testing Using Video-Oculography and Deep Learning-Based Object Detection [O] . Masakazu Hirota, Takao Hayashi, Emiko Watanabe, 2021

机译：使用视频 - 眼影和基于深度学习的对象检测在平滑追踪眼运动测试期间自动记录目标位置
9. Internetworking: Economical Storage and Retrieval of Digital Audio and Video forDistance Learning [R] . Tiddy, M. E. 1996

机译：网络互联：用于远程学习的数字音频和视频的经济存储和检索

Deep Learning-Based Video Retrieval Using Object Relationships and Associated Audio Classes

摘要

著录项

相似文献

相关主题

期刊订阅