首页> 外文会议>International Conference on Multimedia Modeling >Deep Learning-Based Video Retrieval Using Object Relationships and Associated Audio Classes
【24h】

Deep Learning-Based Video Retrieval Using Object Relationships and Associated Audio Classes

机译:使用对象关系和关联的音频类进行基于深度学习的视频检索

获取原文

摘要

This paper introduces a video retrieval tool for the 2020 Video Browser Showdown (VBS). The tool enhances the user's video browsing experience by ensuring full use of video analysis database constructed prior to the Showdown. Deep learning based object detection, scene text detection, scene color detection, audio classification and relation detection with scene graph generation methods have been used to construct the data. The data is composed of visual, textual, and auditory information, broadening the scope to which a user can search beyond visual information. In addition, the tool provides a simple and user-friendly interface for novice users to adapt to the tool in little time.
机译:本文介绍了2020 Video Browser Showdown(VBS)的视频检索工具。该工具通过确保充分利用在Showdown之前构建的视频分析数据库来增强用户的视频浏览体验。基于深度学习的对象检测,场景文本检测,场景颜色检测,音频分类和带有场景图生成方法的关系检测已用于构建数据。数据由视觉,文本和听觉信息组成,从而扩大了用户可以搜索的范围,超出了视觉信息。此外,该工具还为新手用户提供了一个简单易用的界面,使他们可以在短时间内适应该工具。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号